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METHODS TO IDENTIFY COMPOUNDS FOR 
DISRUPTING PROTEIN/PROTEIN INTERACTIONS 

Background o f the Invention 

The present invention relates to a novel method to identify 
inhibitors of protein/protein interactions. 

Background 

Modulation of protein/protein interactions is an attractive target 
for drug discovery and development. Potential methods by which drugs can 
regulate protein/protein interactions are numerous, including, for example, 
regulation of expression of one or more of the binding proteins, modulation 
of post-translational modification, and direct interference with the capacity of 
one protein to bind to one or more binding partners. More importantly, 
recent observations make it increasingly clear that supramolecular protein 
complexes, involving two or more binding proteins, play an important and 
essential roles in signal transduction, gene expression, cell proliferation and 
duplication, and cell cycle progression. For example, in the repair of UV 
damaged DNA, a so-called "repairsome" that contains over ten individual 
proteins is assembled into a complex which can then carry out the necessary 
repair. Likewise, gene transcription occurs through the concerted action of 
greater than twenty proteins. Signal transduction proteins, such as receptor 
protein kinases, are part of large complexes with many proteins. Contacts 
through Src homology type 2 (SH2) domains on the receptor kinases, for 
example, are noteworthy protein interaction which are part of one or more 
enzymatic cascade important for many metabolic processes. Disrupting the 
binding capacity of one or more proteins which form any of these larger 
complex is therefore an important and untapped method to control action of 
the overall complex. 

Protein/protein interactions have been discovered and 
characterized by a variety of methods: (i) standard biochemical affinity 
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methods such as chromatography or co-immunoprecipitations; (ii) gel overlay 
methods; (iii) co-purification by traditional biochemistry; and (iv) two-hybrid 
analysis [Fields and Song, Nature 340:245-246 (1989); Fields, Methods: A 
Companion to Methods in Enzymology 5: 1 16-124 (1993); U.S. Patent 5,283, 
5 173 issued February 1, 1994 to Fields, et al.]. The most recent of these 
approaches, the two hybrid method, has enjoyed broad application because of 
its relative ease of use for gene identification from cDNA fusion libraries. 
[See Chien et al.. Proc. Natl. Acad. Sci. (USA) 88:9578-9582 (1991); Dalton 
and Treisman, Cell 72:223-232 (1993); and Durfee, etai. Genes and Devel. 

10 7:555-569 (1993)]. 

The two hybrid system is based on targeting and identifying a 
protein/protein interaction through the use of a reporter system. The 
described two hybrid systems either use the yeast Gal4 DNA binding domain 
or the E. coli lexA DNA binding domain and couple this region to a 

15 transcriptional activator such as Gal4 or VP 16 that drives a reporter like 0 
galaclosidase or fflS3. 

In principle the two hybrid assay could be used for drug 
screening. [See WO 96/03501 and WO 96/03499.] In such a scenario, loss 
of 0 galactosidase or HIS3 activity would be identified after the yeast sixain 

20 is treated with a compound. In practice, however, use of the two hybrid 
system is technically undesirable for several reasons. In instances where the 
0 galactosidase or HIS3 protein arc employed as the reporter protein, a loss 
of activity is particularly difficult to detect because the expressed reporter 
protein is too long lived to be used in a high throughput mode. If a candidate 

25 binding inhibitor compound is metabolized faster than the previously expressed 
reporter protein is turned over, it is difficult to delect inhibitory action of the 
candidate drug while a reporter protein is still active. In high throughput 
screening, the loss of a positive signal, for example, 0 galactosidase or IIIS3 
is impossible to detect. Present robotocized screening and detection methods 

30 are simply not sufficiently sensitive or robust to detect loss of a signal. 
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Thus there is a need in the art to develop a rapid screening 
method that gives a positive signal, as opposed to a negative signal, when a 
protein/protein interaction is disrupted. Such a system must be capable of 
using protein interactions that are initially detected by any of the above 
5 mentioned approaches and must be sufficiently robust to detect a gain of 
function when a protein interaction is lost. In essence, the screening method 
must give a signal when an interaction is lost, not lose a signal when an 
interaction is lost. Such a system must be sensitive to subtle interactions, in 
particular ones that are caused by post-translational modification like protein 
10 phosphorylation. Finally for large scale screening, such as high throughput 
screening, the system must be manipulate such that a large signal-to-noise 
ration can be easily detected. 



Brief Summary of the Invention 
In one aspect, the present invention provides materials that are 

15 useful for the identification of compounds which inhibit interaction between 
known binding partner proteins. See Figure 1 . The invention provides host 
cells transformed or transfected with DNA comprising: (i) a repressor gene 
encoding DNA binding protein that acts as a repressor protein, said repressor 
gene under transcriptional control of a promoter; (ii) a selectable marker gene 

20 encoding a selectable^marker protein; said selectable marker gene under 
transcriptional control of an operator; said operator regulated by interaction 
with said repressor protein; (iii) a first recombinant fusion protein gene 
encoding a first binding protein or binding fragment thereof in frame with 
either a DNA binding domain of a transcriptional activating protein or a 

25 transacti vating domain of a transcriptional activating protein; and (i v) a second 
recombinant fusion protein gene encoding a second binding protein or binding 
fragment thereof in frame with either a DNA binding domain of a 
transcriptional activating protein or a transactivating domain of a 
transcriptional activating protein, whichever domain is not encoded by the first 
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fusion protein gene, said second binding protein or binding fragment thereof 
capable of interacting with said first binding protein or binding fragment 
thereof such that interaction of said second binding protein or binding 
fragment thereof and said first binding protein or binding fragment thereof 
brings into proximity a DNA binding domain and a transacting domain 
forming a functional transcriptional activating protein; said functional 
transcriptional activating protein acting on said promoter to increase 
expression of said repressor gene. 

The invention comprehends host cells wherein the various genes 
and regulatory sequences are encoded on a single DNA molecule as well as 
host cells wherein one or more of the repressor gene, the selectable marker 
gene, the first recombinant fusion protein gene, and the second recombinant 
fusion protein gene arc encoded on distinct DNA expression constructs In 
a preferred embodiment, the host cells are transformed or transfected with 
DNA encoding the repressor gene, the selectable marker gene, the first 
recombinant fusion protein gene, and the second recombinant fusion pnjtein 
gene, each encoded on a distinct expression construct. Regardless of the 
number of DNA expression constructs introduced, each transformed or 
transfected DNA expression construct further comprises a selectable marker 
gene sequence, the expression of which is used to confirm that transfection or 
transformation was, in fact, accomplished. Selectable marker genes encoded 
on individually transformed or transfected DNA expression constructs are 
distinguishable from the selectable marker under transcriptional regulation of 
the tet operator in that expression of the selectable marker gene regulated by 
the tet operator is central to the preferred embodiment; i.e., regulated 
expression of the selectable marker gene by the tet operator provides a 
measurable phenotypic change in the host cell that is used to identify a binding 
protein inhibitor. Selectable marker genes encoded on individually 
transformed or transfected DNA expression constructs are provided as 
determinants of successful transfection or transformation of the individual 
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DNA expression constructs. Preferred host cells of the invention include 
transformed S. cerevisiae strains designated YI596 and Y1584 which were 
deposited August 13, 1996 with the American Type Culture Collection 
(ATCC). 12301 Parklawn Drive, Rockville, Maryland 20852, and assigned 
Accession Numbers ATCC 74384 and ATCC 74385, respectively. 

The host cells of the invention include any cell type capable of 
expressing the heterologous proteins required as described above and which 
arc capable of being transformed or transfected with functional promoter and 
operator sequences which regulate expression of the heterologous proteins also 
as described. In a preferred embodiment, the host cells are of either mammal, 
insect or yeast origin. Presently, the most preferred host cell is a yeast cell. 
The preferred yeast cells of the invention can be selected from various strains, 
including the S. cerevisiae yeast transformants described in Table 1. 
Alternative yeast specimens include S.pombe. K.lactis. P.pastorts, 
S.carlsbergensis and C.albicans. Preferred mammalian host cells of the 
invention include Chinese hamster ovary (CHO), COS. HeLa, 3T3, CV1, 
LTK, 293T3, Rati, PC12 or any other transferable cell line of human or 
rodent origin. Preferred insect cells lines include SF9 cells. 

In a preferred embodiment, the selectable marker gene is 
regulated by an operator and encodes an enzyme in a pathway for synthesis 
of a nutritional requirement for said host cell such that expression of said 
selectable marker protein is required for growth of said host cell on media 
lacking said nutritional requirement. Thus, as in a preferred embodiment 
where a repressor protein interacts with the operator, transcription of the 
selectable marker gene is down-regulated and the host cells are identified by 
an inability to grow on media lacking the nutritional requirement and an 
ability to grow on media containing the nutritional requirement. In a most 
preferred embodiment, the selectable marker gene encodes the HIS3 protein, 
and host cells transformed or transfected with a HIS3-encoding DNA 
expression construct are selected following growth on media in the presence 
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and absence of histidinc. The invention, however, comprehends any of a 
number of alternative selectable marker genes regulated by an operator. Gene 
alternatives include, for example URA3, LEU2, LYS2 or those encoding any 
of the multitude of enzymes required in various pathways for production of 
5 a nutritional requirement which can be definitively excluded from the media 
of growth. In addition, conventional reporter genes such as chloramphenicol 
acetyltransferase (CAT), firefly luciferase, /3-galactosidase (0-gal), secreted 
alkaline phosphatase (SEAP), green fluorescent protein (GFP), human growth 
hormone (hGH), ^-glucuronidase, neomycin, hygromycin, thymidine kinase 

10 (TK) and the like may be utilized in the invention. 

In the preferred embodiment, the host cells include a repressor 
protein gene encoding the tetracycline resistance protein which acts on the tet 
operator to decrease expression of the selectable marker gene. The invention, 
however, also encompasses alternatives to the tet repressor and operator, for 

1 5 example, E. colt trp repressor and operator, his repressor and operator, and lac 
operon repressor and operator. 

The DNA binding domain and transactivating domain 
components of the fusion protein may be derived from the same transcription 
factor or from different transcription factors as long as bringing the two 

20 domains into proximity permits formation of a functional transcriptional 
activity protein that increases expression of the repressor protein with high 
efficiency. A high efficiency transcriptional activating protein is defined as 
having both a DNA binding domain exhibiting high affinity binding for the 
recognized promoter sequence and a transactivating domain having high 

25 affinity binding for transcriptional machinery proteins required to express 
repressor gene mRNA. The DNA binding domain component of a fusion 
protein of the invention can be derived from any of a number of different 
proteins including, for example, Lex A orGal4. Similarly, the transactivating 
component of the invention's fusion proteins can be derived from a number 

30 of different transcriptional activating proteins, including for example, Gal4 or 
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VP 1 6. In one embodiment of the invention, polynucleotides encoding 
binding partner proteins CREB and CBD are inserted in plasmids pVP16- 
CREB and pLexA-CBD, respectively, which were deposited with the ATCC 
and assigned Accession Numbers ATCC 98138 and ATCC 98139, 
5 respectively. 

The promoter sequence of the invention which regulates 
transcription of the repressor protein can be any sequence capable of driving 
transcription in the chosen host cell. The promoter may be a DNA sequence 
specifically recognized by the chosen DNA binding domain of the invention, 

10 or any other DNA sequence with which the DNA binding domain of the 
fusion protein is capable of high affinity interaction. In a preferred 
embodiment of the invention, the promoter sequence of the invention is either 
a HIS3 or alcohol dehydrogenase (ADH) promoter. In a presently most 
preferred embodiment, the ADH promotor is employed in the invention. The 

15 invention, however, encompasses numerous alternative promoters, including, 
for example, those derived from genes encoding HIS3, ADH, URA3, LEU2 
and the like. 

In another aspect, the invention provides methods to identify 
molecules that inhibit interaction between known binding partner proteins. In 

20 one embodiment, the invention provides a method to identify an inhibitor of 
binding between a first binding protein or binding fragment thereof and a 
second binding protein or binding fragment thereof comprising the steps of (a) 
growing host cells transformed or transfected as described above in the 
absence of a test compound and under conditions which permit expression of 

25 said first binding protein or binding fragment thereof and said second binding 
protein or binding fragment thereof such that said first binding protein or 
fragment thereof and second binding protein or binding fragment thereof 
interact bringing into proximity said DNA binding domain and said 
transactivating domain forming a functional transcriptional activating protein; 

JO the transcriptional activating protein acting on said promoter to increase 
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expression of said repressor protein; said repressor protein interacting with 
said operator such that said selectable marker protein is not expressed; (b) 
confirming lack of expression of said selectable marker protein in said host 
cell; (c) growing said host cells in the presence of a test compound; and (d) 
5 comparing expression of said selectable marker protein in the presence and 
absence of said test compound wherein increased expression of said selectable 
marker protein is indicative that the test compound is an inhibitor of binding 
between said first binding protein or binding fragment thereof and said second 
binding protein or binding fragment thereof. 

10 In a most preferred embodiment, the invention provides a 

method to identify an inhibitor of binding between a first binding protein or 
binding fragment thereof and a second binding protein or binding fragment 
thereof comprising the steps of: (a) transforming or transfecting a host cell 
with a first DNA expression construct comprising a first selectable marker 

1 5 gene encoding a first selectable marker protein and a repressor gene encoding 
a repressor protein, said repressor gene under transcriptional control of a 
promoter; (b) transforming or transfecting said host cell with a second DNA 
expression construct comprising a second selectable marker gene encoding a 
second selectable marker protein and a third selectable marker gene encoding 

20 a third selectable marker protein, said third selectable marker gene under 
transcriptional control of an operator, said operator specifically acted upon by 
said repressor protein such that interaction of said repressor protein with said 
operator decreases expression of said third selectable marker protein; (c) 
transforming or transfecting said host cell with a third DNA expression 

25 construct comprising a fourth selectable marker gene encoding a fourth 
selectable marker protein and a first fusion protein gene encoding a first 
binding protein or binding fragment thereof in frame with either a DNA 
binding domain of a transcriptional activation protein or a transactivattng 
domain of said transcriptional activation protein; (d) transforming or 

30 transfecting said host cell with a fourth DNA expression construct comprising 
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a fifth selectable marker gene encoding a fifth selectable marker protein and 
a second fusion protein gene encoding a second binding protein or binding 
fragment thereof in frame with either the DNA binding domain of said 
transcriptional activation protein or the transactivating domain of said 
transcriptional activation protein, whichever is not included in first fusion 
protein gene; (e) growing said host cell under conditions which permit 
expression of said first binding protein or fragment thereof and said second 
binding protein or fragment thereof such that said first binding protein or 
fragment thereof and second binding protein or binding fragment thereof 
interact bringing into proximity said DNA binding domain and said 
transactivating domain reconstituting said transcriptional activating protein; 
said transcriptional activating protein acting on said promoter to increase 
expression of said repressor protein; said repressor protein interacting with 
said operator such that said third selectable marker protein is not expressed; 
(0 detecting absence of expression of said selectable gene; (g) growing said 
host cell in the presence of a test compound of binding between said first 
protein or fragment thereof and said second binding protein or fragment 
thereof; and (h) comparing expression of said selectable marker protein in the 
presence and absence of said test compound wherein decreased expression of 
said selectable marker protein is indicative of an ability of the test compound 
to inhibit binding between said first binding protein or binding fragment 
thereof and said second binding protein or binding fragment thereof such that 
said transcriptional activating protein is not reconstituted, expression of said 
repressor protein is not increased, and said operator increases expression of 
said selectable marker protein. 

The methods of the invention encompass any and all of the 
variations in host cells as described above. In particular, the invention 
encompasses a method wherein: the host cell is a yeast cell; the selectable 
marker gene encodes HIS3; transcription of the selectable marker gene is 
regulated by the tet operator; the repressor protein gene encodes the 
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tetracycline resistance protein; transcription of the tetracycline resistance 
protein is regulated by the HIS3 promoter; the DNA binding domain is 
derived from LexA; and the transactivating domain is derived from VP16. 
In another embodiment, the invention encompasses a method wherein: the host 
5 cell is a yeast cell; the selectable marker gene encodes HIS3; transcription of 
the selectable marker gene is regulated by the let operator, the repressor 
protein gene encodes the tetracycline resistance protein; transcription of the 
tetracycline resistance protein is regulated by the alcohol dehydrogenase 
promoter; the DNA binding domain is derived from LexA; and the 

10 transactivating domain is derived from VP16. 

In alternative embodiments of the invention wherein the host 
cell is a mammalian cell, variations include the use of mammalian DNA 
expression constructs to encode the first and second recombinant fusion genes, 
the repressor gene, and the selectable marker gene, and use of selectable 

1 5 marker genes encoding antibiotic or drug resistance markers (i.e. . neomycin, 
hygromycin, thymidine kinase). 

There are at least three different types of libraries used for the 
identification of small molecule modulators. These include: (1) chemical 
libraries, (2) natural product libraries, and (3) combinatorial libraries 

20 comprised of random peptides, oligonucleotides or organic molecules. 

Chemical libraries consist of structural analogs of known 
compounds or compounds that are identified as "hits" via natural product 
screening. Natural product libraries are collections of microorganisms, 
animals plants or marine organisms which are used to create mixtures for 

25 screening by: (1) fermentation and extraction of broths from soil, plant or 
marine microorganisms or (2) extraction of plants or marine organisms. 
Combinatorial libraries are composed of large numbers of peptides, 
oligonucleotides or organic compounds as a mixture. They are relatively easy 
to prepare by traditional automated synthesis methods, PCR, cloning or 

30 proprietary synthetic methods. Of particular interest are peptide and 
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oligonucleotide combinatorial libraries. Still other libraries of interest include 
peptide, protein, peptidomimetic, multiparallel synthetic collection, 
recombinatorial, polypeptide libraries. 

The utility of the various aspects of the invention is manifest. 
Host cells of the invention are useful to demonstrate in vivo binding capacity 
of both known and suspected binding partner proteins in a recombinant 
system. Such an expression system permits systematic analysis of the 
structure and function of a particular binding protein, thus permitting 
identification and/or synthesis of potential modulators of the physiological 
activity of the binding proteins. The methods of the invention are particularly 
useful to identify and improve molecules which are capable of inhibiting 
specific and general protein/protein interactions. Inhibitors identified by the 
methods of the invention can then be examined for utility in vivo as 
therapeutic and/or prophylactic medicaments for conditions associated with 
various protein/protein interactions. 

Descripti on of the Drawing 
Figure 1 describes the mechanics of the split hybrid assays. 

Detailed IWription of the. Invention 
The present invention relates generally to methods designated 
split hybrid assays to identify inhibitors of protein/protein interactions and is 
illustrated by the following examples describing various methods for making 
and using the invention. In particular, Example 1 relates to construction of 
various plasmids and expression constructs utilized in the invention. Example 
2 described generation of various yeast transformants used to identify inhibitor 
compounds. Examples 3, 4, 5 and 6 address use of the split hybrid assay to 
examine CREB/CBD binding. Tax/SRF binding, CKI/CREB binding and 
AKAP 79 binding to various partner protein, respectively. Example 7 
describe general application of the split hybrid assay. Example 8 relates to 
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use of the split hybrid assay for weakly interacting binding partners. Example 
9 describes general assay methods. Example 10 addresses use of the split 
hybrids assay to identify agents that prevent receptor desensitization and drug 
tachyphylaxis. 



Example 1 
Plasmid Construction 

In the examples that follow, various plasmid constructs were 

utilized as described. To simplify discussion of the exemplified assays, this 

example describes construction of the various plasmids used in the following 

examples. For clarity, the plasmids are grouped according common features 

relating to their applications in the assays later discussed. 

I. Plasmids Encoding Reporte r Gene HIS3 
A. P RS303/lxtetop-MluI 

One copy of the let operator sequence was engineered into 
position -53 in the HIS3 promoter of pRS3 1 3 [Sikorski, R.S. eial.. Genetics 
122:19-27 (1989)] by using the polymerase chain reaction (PCR). Two 
primary PCR reactions using pRS313 as a template were performed which 
utilized a 5'-terminal oligonucleotide designated Eco47m-5* and a 3'-inner 
oligonucleotide designated Tetop internal 3' to yield a primary 5 ' PCR product 
and a 5 '-inner oligonucleotide designated Tetop internal 5 ' and a 3' -terminal 
oligonucleotide designated Nhe I 3' to yield a primary 3'-PCR product. 

Eco47 m-5' SEQ ID NO: 1 

5 '-TTGGTGAGCGCTAGGAGTCACTGCCAG 

Tetop int. 3' SEQ ID NO: 2 

5 ' -TATACTCTATCAATG ATAG AGTA ATTC ATTATGTG ATAATGCC 

Tetop int. 5' SEQ ID NO: 3 

5 ' - ATTACTCTATC ATTG ATAG AGTATATA AAGTAATGTG ATTTC) 

Nhe I 3' SEQ ID NO: 4 
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5 ' - AATTCTGCTAGCCTCTGC AAAGC 

5' and 3' inner oligonucleotides contain complementary sequence such that 3' 
sequence of the primary 5' PCR product overlaps with 5' sequence of the 
primary 3' PCR product. The 5' terminal oligonucleotide contains the 
5 restriction site EcoATTm while the 3' terminal oligonucleotide contains the 
restriction site Nhel in order to facilitate subsequent subcloning. The primary 
PCR reactions were performed with PJu DNA polymerase (Stratagene, La 
Jolla, CA) using reaction conditions described by the manufacturer. PCR 
products were isolated by Biol 01 (Vista. CA) Gene Clean m gel extraction. 
10 The primary 5' and 3' PCR products were then combined in a second PCR 
reaction and amplified using the 5'- and 3'- terminal oligonucleotides, 
Eco47m-5' and Nhel 3'. The second PCR reaction was performed with Veni 
DNA polymerase (New England Biolabs, Beverly, MA) using reaction 
conditions described by the manufacturer, except that the reactions were 

15 supplemented with 4 mM Mg 2 + . The final PCR product contained one lei 
operator sequence inserted into position -53 of the HIS3 promoter and 
nucleotides 52-48 deleted in the construction. The final PCR product was 
isolated, digested with Eco47m and Nhel and cloned into pRS313 previously 
digested with Eco47m and Nhel. The resulting plasmid was designated 

20 pRS313/lxtetop. DNA sequencing confirmed the presence of one copy of the 
tei operator sequence in pRS3 1 3/ 1 xtetop and confirmed integrity of the 
EcoAHB. and Nhel junctions. 

A Mlul restriction enzyme site was engineered into position -22 
in the HISS promoter of pRS3 1 3/1 xtetop by utilizing PCR using Vent DNA 

25 polymerase using pRS3 13/1 xtetop as template. One PCR construct was 
amplified using the 5' terminal oligonucleotide Eco47 B3-5' (SEQ ID NO: 1) 
containing an £co47m restriction site and a 3'-oligonucleotide designated Mlu 
I 3' containing a Mlul restriction site. 
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Mlu I 3' SEQ ID NO: 5 

5 ' -CGC ACGCGTCG AAG AAATCAC ATTACTTTATATA 

A second PCR product was amplified using the 3'-terminal oligonucleotide 
Nhe I 3' (SEQ ID NO: 4) containing a Nhel restriction site and a 5'- 
5 oligonucleotide designated Mlu I 5' containing a Mlul restriction site. 

M, u I 5' SEQ ID NO: 6 

5 ' CGCACGCGTATACTAAAAAATG AGC AGGCAAG 



The first PCR product was isolated and digested with EcoATOL and Mlul, 
while the second PCR product was isolated and digested with Mlul and A7i£l. 

10 These digested products were isolated and ligatcd in a triple ligation with 
pRS313 previously digested with £co47IE and Mel. The resulting plasmid 
was designated pRS313/lxtetop-MluI. DNA sequencing confirmed the 
presence of the Mlul site in pRS3 13/lxtetop-MluI and confirmed that integrity 
of the Eco47m and Nhel junctions were maintained. 

15 A pRS303/lxtetop-MluI plasmid was constructed by first 

removing the EcoAimiNhel fragment containing the altered HIS3 promoter 
from the pRS313/lxtetop-M/wI vector and ligating the isolated fragment into 
pRS303 previously digested with £co47IH and Nhel. DNA sequencing 
confirmed proper insertion of the EcoAimiNhel fragment. 

20 B. pRS303/2xtetop -I.VS? 

One copy each of the tet operator sequence was engineered into 
positions -53 and -22 in the HIS3 promoter of pRS303 [Sikorski, a al.. 
Genetics 122:19-27 (1989)]. PCR was utilized to engineer one copy into 
position -53 which resulted in plasmid P RS303/lxtetop. To insert the second 
25 copy, a Mlul site was introduced at position -22 in the HIS3 promoter using 
PCR. The new plasmid was designated pRS303/lxtetop-MluI. 

The let operator was created by annealing two complementary 
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oligonucleotides tetop-1 and tetop-2. 



tetop-1 SEQ ID NO: 7 

5'-CGCGTACTCTATCATTGATAGAGTA; 

tetop-2 SEQ ID NO: 8 

5'-ATGAGATAGTAACTATCTCATGCGC 



When annealed, the tet operator sequence contains flanking Mlul sites. Both 
oligonucleotides were phosphorylated using T4 polynucleotide kinase (Gibco 
BRL. Grand Island, NY) at 37°C for one hour and annealed by first heating 
at 70°C for 10 minutes and then cooling to room temperature. The annealed 
10 oligonucleotides were isolated and ligated into pRS303/lxtetop-A//uI 
previously digested with Mlul. The resulting plasmid was designated 
pRS303/2xtetop. DNA sequencing confirmed insertion of one copy of the tet 
operator sequence in the Mlul site. 

The LYS2 gene was digested from pLYS2 fHollenberg, S.M. 
1 5 et at. , Mol. Cell.Biol. 15:38 1 3-3822 (1995)] with EcoW and Hitidm and the 
isolated fragment blunt ended using the large fragment of DNA polymerase 
I (Gibco BRL, Grand Island, NY). Phosphorylated Sstl linkers (New England 
Biolabs, Beverly, MA) were ligated to the fragment, the fragment digested 
with Sstl, and the resulting fragment ligated into pRS313 previously digested 
20 with Sstl. The resulting plasmid was designated pRS313/LYS2. 

TheZ.yS2 fragment was removed from pRS3I3/LYS2 with Sstl 
digestion and inserted into pRS303/2xtetop previously digested with Sstl. The 
resulting plasmid was designated pRS303/2xtetop-LYS2. 

Similarly, the LYS2 Sstl fragment was inserted into 
25 pRS303/lxtetop-MluI previously digested with Sstl yield pRS303/lxtetop- 
MIuI-LYS2. 



C. DRS303/3xtetr,p-LYS7 
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Two copies of the tet operator sequence were created by self- 
annealing a palindromic oligonucleotide Tctop 2x with itself. 

Tetop 2x SEQ ID NO: 9 

5 -CGCGTACTCTATCATTGATAGACTCTAGACTCTATCAATGATAGACjTA 

The annealed oligonucleotide contained flanking Mlul sites. The 
oligonucleotide was phosphorylated. annealed, and isolated as above. The 
isolated annealed and Mul-digcsted oligonucleotide was ligated into 
pRS303/lxtetop-A*M-LYS2 previously digested with Mlul to yield 
pRS303/3xtetop-LYS2. The presence of two copies of the tet operator 
sequence in the Mlul site was confirmed by DNA sequencing. 

D pRS303/4xtetop-LYS? anrt p RS303/8xtetn p.T VS7 

Three or seven copies of the tet operator were created using 
PCR with Vent DNA polymerase as described above. Plasmid pUHC-13-3 
IGrossenandBujarg, Proc. Natl. Acad. ScL (USA) §2:5547-5551 (1992)] was 
used as template DNA using 5'- and 3'- oligonucleotides, Mlu I/Sph I 5' and 
Mlu I Sph I 3', containing an exterior Mlul restriction enzyme site nested 
internally by a Sphl restriction enzyme site. 

Mlu I/Sph 1 5' SEQ ID NO: 10 

5 ' -GCG ACGCGTGC ATGCCGTCTTCAAG AATTCCTCG AG 

Mlu I Sph I 3' SEQ ID NO: 1 1 

5'-GCGACGCGTGCATGCCCACCGTACACGCCTACTCGA 

The PCR products were separated on an agarose gel and the ladder of 
differeni sized DNA fragments was isolated, digested with Mlul, and ligated 
into the Mlul restriction site of pRS303/lxtetop-MluI-LYS2. DNA sequenc- 
ing revealed that eithei 1 three or seven copies of tet operators were inserted 
into the Mlu site of pRS303/lxtetop-M«I-LYS2 to provide either 
pRS303/4xtetop-LYS2 or pRS303/8xtetop-LYS2. 
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E pRS303/6xtetop-LYS2 and P RS303/l(W.mp -r VS9 

A Sphl restriction enzyme site was introduced at position -85 
in the HISS promoter of pRS303/3xtetop-LYS2 using PCR with Vent DNA 
polymerase as described. Plasmid pRS303/3xtetop-LYS2 was used as a 
template DNA. A first fragment was amplified using the 5 '-terminal 
oligonucleotide Eco47 m-5' (SEQ ID NO: 1) described above containing an 
EcoAim restriction site and a 3*-oligonucleotide Sph I 3' containing a Sphl 
restriction site. 



f ph 1 y SEQ ID NO: 12 

5 -CATGGCATGCAAAAAAAAAGAGTCATCCGCTAGG 

A second PCR product was amplified using the 3'terminal oligonucleotide 
Nhe I 3' (SEQ ID NO: 4) described above containing a Nhel restriction site 
and a 5'-oligonucleotide containing a Sphl restriction site. 

Sp 5! * 5 ' SEQ ID NO: 13 

5 CATGGCATGCTTAGCGATTGGCATTATCACAT 

The PCR products were isolated as described above. The first PCR product 
was digested with £co47III and Sphl, and the second PCR product was 
digested with Sphl and Nhel. Both digestion products were ligated in a triple 
ligation along with pRS303/3xteto P -LYS2 previously digested with both 
Eco47m and Nhel. The resulting plasmid was designated pRS303/3xtetop- 
SphI-LYS2. The presence of the Sphl site in pRS303/3xtetop-SphI-LYS2 was 
confirmed by DNA sequencing analysis. 

Three copies of tet operators were isolated as a single fragment 
by digesting pRS303/4xtetop-LYS2 with Sphl. The isolated fragment was 
ligated into the Sphl site of pRS303/3xtetop-5 J pAI-LYS2 to yield 
P RS303/6xtetop-LYS2. The presence of three additional copies of the tet 
operator in P RS303/6xtyetop-LYS2 at the Sphl site was confirmed by DNA 
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sequencing. 

Seven copies of tet operators were isolated as a single fragment 
by digesting pRS303/8xtetop-LYS2 with Sphl. The isolated fragment was 
ligated into the Sphl site of pRS303/3xtetop-S/?M-LYS2 to yield 
5 pRS303/10xtetop-LYS2. The presence of seven additional copies of the tet 
operator in pRS303/10xtetop-LYS2 at the Sphl site was confirmed by DNA 
sequencing. 

F- PRS313/Mlnl a nd nRS3m/Ml..I 

A Mid restriction enzyme site was engineered into position -22 
10 in the HIS3 promoter of pRS3 1 3 utilizing PCR and Vent DNA polymerase as 
noted above. Plasmid pRS313 was used as a template for these PCR 
reactions. One PCR construct was amplified using the 5' terminal 
oligonucleotide Eco47 m-5' (SEQ ID NO: 1) containing an Eco47m 
restriction site and a 3 ' oligonucleotide Mlu I 3' (SEQ ID NO: 5) containing 
15 a Mlul restriction site. A second PCR product was amplified using the 3' 
terminal oligonucleotide Nhe I 3' (SEQ ID NO: 4) containing a Nhel 
restriction site and the 5 ' oligonucleotide Mlu 1 5' (SEQ ID NO: 6) containing 
a MM restriction site. The first PCR product was isolated and digested with 
Eco47m and Mlul, while the second PCR product was isolated and digested 
20 with Mlul and Nhel. The digested products were partially purified and joined 
in a triple ligation with pRS313 which had been previously digested with 
Eco47m and Nhel. The resulting plasmid was designated pRS313/MluI. 
DNA sequencing confirmed the presence of the Mlul site in pRS313/Mlul and 
to confirm the integrity of the Eco47lll and Ntiel junctions. 

pRS303/MluI was constructed in exactly die same manner as 
pRS313/MluI except that pRS303 was used in place of pRS313. 



G. pRS313/lxtetop 

See above wherein pRS313/Ixtetop is an intermediate in the 
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construction of pRS303/lxtetop-MluI. 

H pRS313/Mlt.M xtet op and pRS10 3/MluI-l xtetop 

One copy of the tet operator sequence was created by annealing 
two complementary oligonucleotides tetop-l and tetop-2 (SEQ ID NO: 7 and 
SEQ ID NO: 8). The annealed tet operator sequence contains flanking MM 
sites. The oligonucleotides were phosphorylated using T4 polynucleotide 
kinase (Gibco BRL, Grand Island, NY) at 37°C for one hour and annealed by 
first heating at 70°C for 10 minutes followed by cooling to room temperature. 
The annealed oligonucleotides were isolated and ligatcd separately into Mlril- 
digested P RS313/MluI and P RS303yMlul. the resulting plasmids being 
designated pRS3 1 3/MluI- 1 xtetop and P RS303/MluI- 1 xtetop. DNA sequencing 
confirmed the presence of one copy of the tet operator in the MM sites of 
both plasmids. 

In order to produce plasmids bearing multiple copies of the tet 
operator, annealed oligonucleotides described above were ligated together 
overnight at 16°C. After isolation of the ligation products, they were inserted 
into the MM of P RS313/MluI. DNA sequencing analysis confirmed that one 
clone, pRS313/MluI-4xtetop, was produced which contained four copies of tet 
operator in the MM site. However, upon further examination of this clone 
it was discovered that it had been subjected to a recombination event and was 
therefore not useful for further cloning steps. Continued attempts to insert 
multiple copies of the let operator into the MM site of pRS313/MluI by 
ligating multimers of the tet operator have been unsuccessful. 

I. PRS313/Ixtetnp -Ml..f 

See above wherein construction of pRS3 1 3/ 1 xtetop-MluI was 
an intermediate in the construction of pRS303/lxtetop-MluI. 
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J- BES3J3i2jaeiQ E 

One copy of the let operator sequence was created using 
annealed complementary oligonucleotides tetop-1 and tetop-2 (SEQ ID NO: 
7 and SEQ ID NO: 8). Annealed oligonucleotides were ligated into the Mlul 
site of P RS313/lxtetop-MluI to yield P RS313/2xtetop. DNA sequencing 
confirmed the presence of two copies of the tet operator in the Mlul site. 

K. P RS303/?»t^op 

See above wherein P RS303/2xtetop was an intermediate in the 
constmction of pRS303/2x/tetop-LYS2. 

L PRS31 3/I.YS7 andnR<nn/l vqo 

The LYS2 gene was digested from pLYS2 with EcoW and 
Mindm digestion. The EcoVllHintm fragment was blunt ended using the 
large fragment of DNA polymerase I (Gibco BRL, Grand Island, NY) and 
ligated with phosphorylated Sstl linkers (New England Biolabs. Beverly, MA). 
The resulting fragment was digested with Sstl and ligated into pRS313 
previously digested with Sstl. The resulting plasmid was designated 
PRS313/LYS2. Because theLKS2 fragment was shown to have inserted into 
PRS3I3 in both orientations, plasmids with the LYS2 gene in both orientations 
were transformed separately into the yeast strain SEY62!0a_(M47«_ lcu2- 
3.112 uraJ-52 his3-A200 trpl^901 lys2-80l suc2-A9 FRobinson et aL, Mai 
Cell. Biol. 8:4936-4948 (1988)J. Both clones allowed the yeast to grow in the 
absence of lysine indicating that orientation of the LYS2 gene in pRS313 did 
not affect the expression of an active gene. 

TheLKS2 fragment was removed from pRS3I3/LYS2 with Sstl 
and ligated into the Sstl site of: 



P RS313/Ixtetop-MluI giving plasmid pRS3l3/lxtetop-MluI-LYS2, 
pRS313/2xtctop giving plasmid pRS313/2xtetop-LYS2, 
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pRS303/lxtetop-MluI giving plasmid pRS303/lxtetop-MIuI-LYS2, and 
pRS303/2xtelop giving plasmid pRS303/2xtetop-LYS2. 

II. Plasmids Encoding Reporter Gene TetR 
A. P RS306/fflS3:TetR/T e rm 
5 The 5' promoter sequence of the yeast HIS3 gene, 

encompassing nucleotides -75 to +23, was ligated to the translational start of 
TetR. In addition, the DNA sequence encoding the simian virus 40 (SV40) 
large T antigen nuclear localization signal was ligated in frame with the 
nucleotide sequence encoding the last amino acid residue of TetR. The 
10 chimeric fragment was created by the same PCR strategy as described above. 

The HIS3 promoter fragment, the primary 5'-PCR product, was 
amplified by PCR from plasmid p601 fGrueneberg.D.A., Science 257:1089- 
1095 (1992)] using a 5' terminal oligonucleotide T7 Promoter primer and a 
3'inner oligonucleotide 3'-TetR inner primer. 

15 T7 Promoter primer SEQ ID NO: 14 

5'-TAATACGACTCACTATATAGGG 

3'-TetR inner primer SEQ ID NO: 15 

5 ' -TCTAG ACTTTGCCTTCGTTTATC 

The primary 3' PCR product containing the TetR coding sequence was 
20 amplified from pSLF104 [Forsburg, Nucl. Acid. Res. 21:2955-2956 (1993)] 
with a 5'-inner oligonucleotide 5'-TetR inner primer and a 3'-terminal 
oligonucleotide 3'-TetR terminal primer. 

5'-TetR inner primer SEQ ID NO: 16 

5'CGAAGGCAAAGATGTCTAGATTAGATAAAAG 

25 3'-TetR terminal primer SEQ ID NO: 17 

5'-CGCGGATCCGCT7TCTLl I CI I NT! GG AG ACCCACTTTC AC ATTTA AG 
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An EcoRI site derived from the p601 fragment and a BamHl site in the 3'- 
terminal oligonucleotide were used in subsequent subcloning. The PCR 
products were gel-purified and amplified in a second PCR reaction with 5'- 
and 3-' terminal oligonucleotides, T7 Promoter primer (SEQ ID NO: 14) and 
5 3'-TetR terminal primer (SEQ ID NO: 1 7). The secondary PCR product was 
isolated, digested with £coRI and BamHl, and ligated into P RS306/Term 
previously digested with feoRI and BamHl. The resulting plasmid was 
designated P RS306/HIS3:TetR/Term which comprises the complete TetR 
coding sequence in frame with sequences encoding the nuclear localization 
10 signal of SV40 large T antigen. 



20 



25 



B BBS3i MnS3:TetR/T e rrn 

The construction protocol for this plasmid was the same as 
described above for subcloning a HIS3 DNA into pRS306/Term except chat 
the vector for subcloning was pRS316/Term described above. 

C PRS306/ 1 xl^.x Aon/HTSl -T>tP 

Oligonucleotides LexAop (100a) and LexAop (100b) containing 
a single copy of LexA operator were phosphorylated with T4 polynucleotide 
kinase (Gibco BRL, Grand Island. NY) at 37°C for one hour. 

LexAop (100a) SEQ ID NO: 18 

5 ' - AATTGCTCG AGTACTGTATGTACATACAGTAG 

LexAop (100b) SEQ ID NO 19 

5 - AATTCTACTGTATGTACATACAGTACTCGAGC 

Following phosphorylation, the oligonucleotides were annealed by heating at 
70°C for 10 minutes followed by cooling to room temperature. The annealed 
oligonucleotide containing 5 ' and 3 * £coRI overhanging ends was subcloned 
into pRS306/HIS3:TetR/Term previously digested with £coRI. The number 
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of copies of inserted oligonucleotide was confirmed by DNA sequencing. The 
plasmid containing a single copy of the LexA operator was designated 
pRS306/ i xLexAop/HIS3:TetR. 

D. pRS31 6/2xLexAop/HIS3:TetR 
5 The subcloning protocol for this construct was the same as 

described above for pRS306/IxLexAop/fflS3:TetR. The annealed 
oligonucleotides encoding the LexA operator included overhanging EcoKI ends 
and during ligation, the individual annealed fragments were able to 
multimerize, inserting into the parental plasmid more than one copy of the 
10 desired LexA sequence. The number of copies of inserted oligonucleotides 
was confirmed by DNA sequencing. 

E- pRS306/2xLexA o P /fflS3:TetR 

A DNA fragment containing two copies of LexA operator and 
the chimeric HIS3:TelR reporter was excised from 
1 5 pRS3 1 6/2xLexAop/HK3:TetR by digestion with Kpnl and BaniiU restriction 
enzymes. The fragment was gel-purified and subcloned into pRS306/Term 
previously digested with Kpril and BamUl and the resulting construct was 
sequenced to confirm the presence of two copies of the LexA operator. 

F. pRS30674xLexAop/HIS3:TetR 
20 and P RS306/RxI^xAon/HrS3 T ft tR 

A pair of oligonucleotides SHI01A and SH101B were utilized 

in PCR to amplify the LexA binding site multimer from the plasmid SH18- 

34ASpe [Hollenberg, S.M., etal., Mol.Cell.Biol. 15:3813-3822 (1995)]. 

SHI01A SEQIDNO:20 
25 5 ' -CCGG AATTCTCG AG AC ATATCC ATATCTAATC 

SH101B SEQIDNO:21 
5 ' -CCGG AATTCACTAATCGC ATTATC ATC 
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The amplification product containing four copies of LexA operator was gel- 
purified, digested with EcoKL, and subcloned into pRS306/HIS3:TetR/Term 
previously digested with EcoHl. The number of LexA operators were 
determined by DNA sequencing. 

S G. P RS306/8xI^x AoD/HIS3::TetR 

A PCR strategy was used to link the 5' promoter sequence of 
the yeast H1S3 gene encompassing nucleotides-75 to +23 to the translational 
stan of TetR. Sequences encoding the SV40 large T antigen nuclear local- 
ization signal were fused in frame with the nucleotide sequence encoding the 
10 last amino acid residue of TetR. The PCR product was digested with £coRI 
and BamHl and inserted into pRS306/Term previously digested with EcdSl 
and Bamm. The resulting plasmid was designated pRS306/fflS3:TetR/Term, 
and was shown to encode the complete TetR protein in frame with the nuclear 
localization signal of SV40 large T antigen. The fusion protein is followed 
by four amino acids generated by the vector backbone (Arg-Ile-His-Asp). 

The LexA binding site multimer from the plasmid pSH18- 
34ASpe [Hollenberg, S.M. etal.. Mol. Cell. Biol. 15:3813-3822 (1995)] was 
amplified by PCR, digested with EafBtl, and subcloned into the EcoRI site of 
pRS306/fflS3:TetR/Term resulting in plasmid pRS306/8xLexAop/TetR. 

H. DADH/TeiR 

The DNA coding sequence of TetR was amplified by PCR from 
pSLF104 using two oligonucleotides, NcoI-TetR and 3'-TetR terminal primer 
(SEQIDNO: 17). 



NcoI-TetR SEQ ID NO: 22 

5 '-C ATGCCATGGCCATGTCTAGATTAGATAAAAG 
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The resulting product was gel-purified, digested with Ncol and BamHl, and 
subcloncd into a pBTM116 (Bartel, a ai, in Cellular Interactions in 
Development: a Practical Approach Hartley (ed.), IRL Press; Oxford, pp. 
153-179 (1993)) shuttle vector containing an ADH promoter, previously 
5 digested with Ncol and BamHl. For construction of this vector, DNA 
generated by PCR and DNA obtained by restriction enzyme digestion of the 
polylinker region in plasmid pBluescript (Stratagene, La Jolla, California) 
were used to engineer additional restriction sites 5 ' and 3 ' of the ADH 
promoter. The TetR protein encoded from this construct is expressed 
1 0 containing additional amino acids Met^-Ala" 1 before the initiating methionine 
and also contains the nuclear localization signal of SV40 large T antigen 
located after the last amino acid of TetR as described above. 

I. DRS306/ADH:Te.tR/T>.rm 

A fragment encoding the ADH promoter and TetR was removed 
15 from plasmid pADH/TetR with Xhol and blunted-ended with the large 
fragment of DNA polymerase I (Gibco BLR, Grand Island. NY). EcoRl 
linkers (New England BioLabs, Beverly, MA) were added and the fragment 
was digested with EcoKl and BamHl. The resulting fragment was gel-purified 
and ligated into pRS306/Term previously digested with EcoHl and BamHl. 

20 J. pRS306/4xLexAop/ADH::TetR 

and pRS306/8xI^Anp/Ar>H- TWP 

The subcloning protocol used to insert multiple copies of the 
LexA operator into pRS306/ ADH :TetR/Term was the same as described 
previously for pRS 306/4 x Le x Aop/ HIS 3 : TetR and 
25 pRS306/8xLexAop/HIS3:TetR. 
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III. Plasmids Encoding Binding Proteins 



A. pLexA-CBD 

A DNA fragment containing the CREB binding domain of CBP 
(CBD), amino acids 461-682, was PCR amplified from plasmid CBP-0.8 
[Chrivia, J.C. et at., Nature 365:855-859 (1993)] using a pair of 
oligonucleotides designated 5' CBD primer and 3' CBD primer. 

5* CBD primer SEQ ID NO 23 

5'-GCGAATTCGCCAGGGCAACAGAATGCCACT 

3' CBD primer SEQ ID NO: 24 

5 ' CGGGATCCTGGCTGGTTACCC AGG ATGCCTTG 



Following gel purification, the amplification product was digested with EcoM 
and Bamm, and ligated into plasmid pBTMI16 [Bartel, ei ai, in Cellular 
Interactions in Development: a Practical Approach, (ed) Hartley, D.A. (IRL 
Press, Oxford), pp. 153-179 (1993)] previously digested with £coRI and 
BamlU. 



B. P VP16-CBD 

A DNA fragment encoding the CBP sequence was excised from 
pLexA-CBD by digestion with £coRI and Bamm. Plasmid pLexA-CBD was 
linearized with EcoM digestion, the resulting overhanging ends blunt-ended 
using the Klenow fragment of DNA polymerase I, and the ends ligated with 
BaniHl linkers. The resulting fragment was inserted into pVP16 [Hollenberg, 
etal., Mol. Cell. Biol. 15:3813-3822 (1995)) previously digested with into 
BamHl. 
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C. pVP16 CREB 

Plasmid pcDNA3/CREB283 [Sun and Maurer, J. Biol. Chem. 
270:7041-7044 (1995)], containing the VP16 transactivation domain fused to 
sequences of the rat CREB transactivation domain (1 to 283 aa) was linearized 
5 with Xhol and BamHI linkers (New England BioLab) ligated to the resulting 
blunt-ended Xhol sites. DNA encoding the VP16/CREB chimeric protein was 
removed with HindW and BamHI digestion and following gel purification, 
ligated into the Hindm and BatnHl sites of pVP16 which encodes the LEU2 
gene. 



10 D. pVPI 6-CREBfBgin-Sacm-LacZ 

A DNA fragment encoding #-galactosidase was PCR amplified 
from plasmid pSV-/3-galactosidase vector (Promega, Madison, WI) using a 
pair of oligonucleotides, 5 ' 0-gal primer and 3 ' 0-gal primer and inserted into 
the Noll site of pVP16 to produce pVP16-LacZ. 



15 5 ' 0-gal primer SEQ ID NO: 29 

5 ' -ATGGTACC AGCGGCCGCTAGTCGTTTTAC AACGTCGTGAC 

3 ' 0-gal primer SEQ ID NO: 30 

5 '-ATGGTACCGCGGCCGCTTATTTTTGACACCAGACCAAC 



A PCR fragment containing CREB sequences encoding amino acid residues 
20 1 to 283 was amplified from plasmid pRSV-CREB34I [Kwok, ei al. , Nature 
380: 642-646 (1996)] using a pair of oligonucleotides, 5 ' CREB 341 primer 
and 3 ' CREB 283 primer, and inserted into pVP16-LacZ vector at the BamHI 
site. 

5 ' CREB 341 primer SEQ ID NO: 25 

„ 25 5 -CGCGGATCCGGATGACCATGGACTCTGGAG 



3 ' CREB 283 primer SEQ ID NO: 28 

5 -CGCGGATCCGTGCTGCTTCTTCAGCAGGCTG 
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To generate a cassette vector for producing and subcloning mutated CREB 
sequences as described below, PCR was used to engineer a Bgia site using 
oligonucleotides 5' BgUI primer and 3' Bgia primer, at nucleotides 273 to 
278 and a SacU site using oligonucleotides 5' SacTL primer and 3' SacU 
primer at nucleotides 500 to 505 of the CREB activation domain. 



5 ' BglO. primer SEQ ID NO: 3 1 

5 -CGGAGATCTAAAGAGACTTTTCTCCGGAACTCAG 

3 ' Bgia primer SEQ ID NO: 32 

5 '-CGGAGATCTTTACAGGAAGACTGAACTGT 

5 Sacll primer SEQ ID NO- 33 

5 -CCACCGCGGCAGTGCCAACCCCGATTTAC 

3 ' SacU primer SEQ ID NO: 34 

3 -CATCCGCGGTGGTGATGGCAGGGGCTGA 

A DNA fragment containing the rat CREB transactivalion 
domain (amino acids 1 to 283) was excised from pcDNA/CREB283 [Sun and 
Maurer, supra] with Smal and XbaJ digestion. The 5 ' Xbal site was blunt 
ended with the large fragment of DNA polymerase I (Gibco BRL, Grand 
Island, NY) and Sail linkers (New England Biolabs. Beverly, MA) added. 
The fragment was digested with Sa/I and subcloned into the SaB. site of 
pBTMl 16. 



F. pLexA-CRFB 1A1 

A DNA fragment containing the rat CREB 341 cDNA was 
amplified by PCR from pcDNA/CREB34 1 [Kwok. supra) using a pair of 
25 oligonucleotides, 5 ' CREB 341 primer (SEQ ID NO: 25) and 3 ' CREB 341 
primer. 
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3 ' CREB 341 primer SEQ ID NO: 26 

5 -CGCGG ATCCTTAATCTG ACTTGTGGC AGTA 

After gel purification, the PCR product was digested with BamHI, and 
subcloned into the BamHI site of pBTMl 16. 

5 G. pLexA-CREB 141 -Ml 

A DNA fragment containing the rat CREB sequence with a 
mutation changing serine al position 133 to alanine was amplified by PCR 
from plasmid Rc/RSV CREB-M1 [Kwok, et al., supra) using the same set of 
primers as described for pLexA-CREB 341, 5 ' CREB 341 primer (SEQ ID 
10 NO: 25) and 3' CREB 341 primer (SEQ ID NO: 26). The resulting 
amplification product was gel-purified, digested with BamHl, and subcloned 
into the BamHI site of pBTMl 16. 

H. PVP16-CRER Ml 

A PCR fragment containing CREB sequences coding for amino 
15 acid residues I to 283 including the serine 133 mutation to alanine was 
amplified using a pair of oligonucleotides. 5 ' CREB 283 primer and 3 ' CREB 
283 primer (SEQ ID NO: 28). The PCR fragment was gel-purified, digested 
with BamHI and inserted into the BamHl site of pVP16. 

5 ' CREB 283 primer SEQ ID NO- 27 

20 5 -CGCGGATCCCCATGACCATGGAATCTGGAGCC 

I. DLexA-SRF 

A DNA fragment containing human SRF was excised from 
plasmid pCGN-SRF [Grueneberg, D.A., et al., Science, 257:1089-1095 
(1992)] with Xhol and BamHI digestion. The Xhol site of the fragment was 
!5 blunt-ended by the large fragment of DNA polymerase I (Gibco BRL, Grand 
Island, NY), ligated with BamHI linkers, digested with BamHI, and inserted 
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into pBTMl 16 previously digested with BamHl. 

J. pVP16-Tax 

A DNA sequence encoding full length Tax protein was excised 
from pS6424 [Kwok, R.P.S. , et al. , Nature 380:642-646 (1996)] with BamHl 
5 digestion and was inserted into pVP16 previously digested with BamHl. 

IV. Plasmids For Binding Protein Controls 

A. pLeu 

Plasmid pVP16 was digested with Hindm and BamHl to 
remove the fragment encoding the VP1 6 transactivation domain. The digested 
10 vector was blunt-ended and self-ligated. 

B. pLexA-VP16 

The VP16 transactivation domain was PCR amplified from 
pGaI-VP16 [Sadowski, et al., Namre 335:563-564 (1988)] with a pair of 
oligonucleotides, 5 -VP16SH and 3 VP16SH and the resulting amplification 
15 product was digested with Ool, blunt-ended, and inserted into pBTMl US. 



5-VP16SH 



SEQ ID NO: 35 



GGCTATCGATACGGCCCCCCCGACCGAT 



3-VP16SH 



SEQ ID NO: 36 



GCGTATCGATCTACCCACCGTACTCGTC 



20 



C. 



pLexA-Lamin 

See Hollenberg. S.M. et al., Mol.Cell.Biol. 15:3813-3822 



(1995)]. 
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V - Plasmids Encoding RepnrtPr Gene Cnntmk 

A. PRS306/Temi 

The alcohol dehydrogenase (ADH) terminator sequence was 
excised from plasmid pBTMl 16 [Barte!, « o/ ., j„ Cellular Interactions in 
Development: a Practical Approach, (ed) Hartley, D. A. (IRL Press, Oxford), 
pp. 153-179 (1993)] with Sphl and Pstl restriction enzymes and both 3'- 
overhanging sequences were blunted by T4 DNA polymerase (Gibco BLR, 
Grand Island. NY). The fragment was gel-purified and subcioned into the 
blunt-ended Noil site in pRS306 'Sikorski and Hieter, Genetics: 122: 19-27 
(1989)J. The orientation of inserted fragment was determined by DNA 
sequencing. 

B. PRS316/Te.rm 

The subcloning protocol for inserting the ADH terminator 
sequence into P RS3I6 was the same as described for inserting the ADH 
sequence in pRS306. 

Example 2 
Generation of Yeast Assay Transformant 
Selection of an appropriate yeast assay strain is an empirical 
determination based on growth characteristics of the transformed alternatives. 
A general method to make the appropriate selection is described as follows. 

Candidate yeast assay strains were transformed individually with 
reporter gene constructs and/or a plasmid encoding one of the experimental 
binding proteins. Assay strains thus transformed were then compared for 
relative differences in growth characteristics, with an optimal assay strain 
showing negligible growth on media lacking histidine and vigorous growth on 
media containing histidine. In practical application of this first step in selection 
using various plasmids transformed into assay strain YI584, the following 
results were observed. 
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When the plasmid pLexA-VPI6 encoding both the LexA DNA 
binding domain and the VP! 6 transactivating domain as a single protein was 
introduced into the assay cells, growth in the absence of histidine in the media 
was significantly reduced three days after transformation. 
5 In assays including transformation with plasmids encoding 

multiple copies of the tet operator upstream of the HIS3 gene, the following 
plasmids were separately utilized: 

pRS303/lxtetop-/WS (encoding a single tet operator sequence), 
pRS303/2xtetop -HIS (encoding two tet operator sequences). 
10 pRS303/3xtetop-///S (encoding three tet operator sequences), 
pRS303/4xtetop-///S (encoding four tet operator sequences), 
pRS303/6xtetop-///5 (encoding six tet operator sequences), 
pRS303/8xtetop-///S (encoding eight let operator sequences), or 
pRS303/10xtetop-7//S (encoding ten tet operator sequences). 

15 In the assay strains transformed with plasmids encoding either one, two, or 
three copies of the let operator upstream from the HIS3 gene, cells grew on 
media lacking histidine at a rate similar to cells grown on media containing 
histidine. In yeast assay strains transformed with plasmids encoding either 
six. eight, or ten copies of the tet operator upstream from the HIS3 gene, cell 

20 growth was low suggesting that these strains would not be useful in assays to 
examine binding and interruption of binding between test proteins. These 
results suggested that, in assay strains transformed with a reporter plasmid 
having more than three tet operator sequences upstream from the HIS3 gene, 
normal activity of the HIS3 promoter is disrupted and that these plasmids 

25 would not be useful. 

In assays wherein yeast cells were transformed with only 
reporter plasmids (and not plasmids encoding binding partner fusion proteins) 
encoding multiple copies of the LexA operator 5 ' of the TetR gene, the 
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following results were observed. Growth of assay cells transformed with 
plasmids bearing one, two, four, and eight copies of the regulatory LexA 
operator upstream of the TetR gene appeared to be "copy number" dependent. 
Yeast cells transformed with plasmids having two copies of the LexA operator 
5 grew at a rate significantly higher than those assay cell transformed with a 
plasmid bearing only one copy of the operator. Cells transformed with 
plasmids encoding either four or eight LexA operators upstream of the TetR 
gene grew at an approximately equal rate, and better than assay cells bearing 
a TetR gene driven by two copies of the operator. 
10 When the alcohol dehydrogenase (ADH) promoter was included 

upstream of the LexA operator (plasmids encoding either four or eight LexA 
operators) in the various reporter gene constructs, ceil viability was the 
lowest. 

The various cell lines constructed by the methods described 
15 above are shown in Table I, wherein various transformed yeast strains are 
identified (Strain It) along with the number of LexA operator sequences in the 
plasmid encoding TetR, the number of tetracycline operator sequences 
regulating expression of HIS3, and relative growth rate of the transformed 
strain on media containing histidine. It is important to note that growth 
20 variation of transformed cells in media containing histidine is observed, even 
in cell lines identically transformed. The number of " + " signs in Table 1 is 
indicative of the host cell's relative ability to grow on media lacking histidine 
in the absence of transformation with plasmids encoding potential binding 
proteins. Also in Table 1 , a subscript "a" is indicative of transformation with 
25 a plasmid bearing the alcohol dehydrogenase promoter; absence of a subscript 
"a" indicates use of the HIS3 promoter. 
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Table 1 
Various Yeast Transfnrmants 



Strain # Lex A TelOp Hi»- 
VI579 IX 2X + + h 
YI58I IX 2X + + ^ 



Y1580 


2X 2X 




YI582 


2x :x 




Diploids 


L40 




Strain a 


LexA TrfOp HU + 


YI583 


4X 2X 




YI585 


4X 2X 




YI5R7 


4X 2X 




YI589 


4X 2X 




YI584 


8X 2X 




YI586 


8X 2X 




YIS88 


8X 2X 






8X 2X 




Diploids L40 




Strain a 


LeiATelOp Hi> + 


YI50I 


2X 2X 






2X 2X 




YIS97 


:x 4X 




YI633 


2X 4X 




Y1636 


2X 4X 




YI600 


2X 6X 




YI606 


2X 6X 




YI630 


2X 6X 




Y1627 


2X 6X 




Y1603 


2X I0X 




Y1621 


2X I0X 




YI609 


2X I0X 




YI624 


2X 10X 




YIS93 


4X, 2X 




YI59S 


4X 2X 




Y1599 


4X a 4X 




YI634 


4X 4X 




YI63B 


«X„ 4X 





YI607 


4X 


6X 






YI628 


4X 








YI632 


4X = 












, I0X 






YI6I0 


4X° 


I0X 






YI622 


4X 


I0X 






YI62b 


*x. 


10X 






YI592 


8X 


2X 






YI596 


8X„ 


2X 






V159S 


8X 


4X 






Y163S 


* x * 


















YI60I 


8X 


6X 






YI608 










YI629 


8X° 








YI63I 










YI604 


8X 


I0X 






YI6II 


8X o 


I0X 






YI623 




10X 






YI625 




I0X 








Lc*A 


TetOp unin 




YI664 




JX 


w303(5O) 




YI666 


4X I 


IX 


w303(SI) 




YI668 


4X„ 


:x 


L40 (691 




Y1670 


4X 3 


2X 


L40 (70) 




YI66S 




3X 


wJ03(50) 




YI667 


8X° 


3X 


w303(5l) 




YI67I 


»X* 


3X 


L40 (69) 




YI669 


8X 3 


2X 


L40 (69) 




YI67I 




2X 


L40 (70) 




YI67I 


8X„ 


6X 


L40 (69) 
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Example 3 
CREB/CBP Binding Interaction 

Use of the split-hybrid assay for studies of protein/protein 
binding wherein one of the binding components is randomly mutagenized was 
carried out using CREB and CBP binding proteins. The binding of CREB to 
CBP has been shown to require the phosphorylation of the CREB serine 
residue at position 133 in a region designated the "kinase-inducible domain" 
(KID) [Chrivia, et al., Nature 365, 855-859 (1993); Kwok, et al.. Nature 
370, 223-226 (1994)]. Functionally, changing serine at position 133 to 
alanine (a mutant designated CREB-M1) abolishes the ability of CBP to 
activate CREB-mediated transcription. Preliminary studies have indicated that 
the CREB-M1 mutant in the split-hybrid system prevents the interaction with 
CBP and subsequent growth of the yeast assay strain on media lacking 
histidine. Precisely what other portions of the KID of CREB are required for 
binding to CBP is unknown, however. To define other potentially important 
amino acid residues, the KID (amino acid residues 102 to 160) of CREB 341 
was randomly mutagenized using PCR. 

A PCR Mutagenesis and C reation of Mutant Library 

The technique used for mutagenic PCR was a modification of 
that described by Uppaluri and TowJe [Mol. Cell. Biol. 15, 1499-1512 
(1995)]. The reaction mixture contained 20 ng of pVP16-CREB(BgUI-SacII)- 
LacZ. 16 mM (NH 4 ) 2 S0 4 , 67 m M Tris-HCI, pH 8.8, 6.1 mM MgCI 2 , 0.5 
mM MnCI 2 , 6.7 /*M EDTA. 10 mM 0-mercaptocthanol, 1 mM primers, ImM 
each dGTP, dTTP, and dCTP, 400 /iM dATP. and 2.5 units of Tag DNA 
polymerase (Promega, Madison, WI). After seven cycles of PCR (94°C for 
40 sec, 50"C for 40 sec, and 72°C for 40 sec), the PCR product was 
amplified a second time using the same primers and Vent DNA polymerase 
(New England BioLabs, Beverly, MA) under the same conditions for 25 
cycles. The resultant PCR product was gel purified, digested with Bgia and 
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SacTl, and inserted into the Bgia and SacU sites of pVP16-CREB(Bgin-SacH)- 
LacZ (constmction of which is described above). The resulting plasmids were 
transformed into DH5or bacterial cells. Transformarjts were pooled and 
plasmid DNA was isolated by CsCI gradient centrifugation. 

B. Construction and Use of p VPIfi-CRFR^gin-Sacni-I^rZ 

A DNA fragment encoding the 0-galactosidase gene was fused 
in frame to the carboxyl-terminal end of VP16-CREB as described above. 
The carboxy-terminal tag allowed identification of clones that contain frame- 
shift and nonsense mutations; colonies that remain positive for 0-galactosidase 
were presumed to contain an open reading frame throughout the mutated 
region. To facilitate the subcloning of mutated sequences, a cassette version 
of the CREB cDNA was generated that contained BgXl and a SacU sites 
flanking the 5 ' and 3 ' ends of the KID. respectively. These modifications 
altered the amino acid residue at position 168 from valine to alanine. The 
cDNA altered in this manner was indistinguishable from the original VP16- 
CREB and from VP16-CREB-LacZ when tested in the split hybrid assay. 
Primers complementary to regions flanking the KID were used in mutagenic 
PCR amplification reactions as described above under conditions which were 
optimized to achieve one to three mutations in the 177 bp region encoding the 
KID. PCR products were introduced into pVPI6-CREB(B^/n-Socn)-LacZ in 
place of wild-type sequence. A library of mutated sequences was transformed 
into yeast assay strain YI584 expressing LexA-CBD. Approximately 27 : 000 
yeast transformants were screened, yielding about 5,000 colonies that were 
capable of growing on selective media supplemented with 10 M g/ml of 
tetracycline and I mM of 3AT, determined as described below. 

Two screening steps were performed to eliminate uninformative 
mutations and false positives. First, filter 0-galactosidase assays were 
performed by standard methods [Vojtek, ei al., Cell 74:205-214 (1993)] on 
the 5,000 colonies which exhibited positive growth on media lacking 
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tryptophan, histidine, uracil, leucine, and lysine to eliminate expressed 
proteins having frame-shift and nonsense mutations. Five hundred thirty six 
colonies developed a dark blue color, whereas 412 colonies turned white and 
were presumed to express mutants containing either frame-shift or nonsense 
mutations. The other colonies developed a pale blue color, and control 
experiments suggested that these colonies may have expressed unstable lacZ 
fusion proteins. Pale blue colonies were not analyzed further. 

DNA from 536 dark blue colonies was isolated and transformed 
into E.coli MCI 066 cells. One hundred ninety three P VP16-CREB-(BglII- 
Sacnj-LacZ cDNAs were then isolated. 

In a second screening step, the 193 cDNAs were separately re- 
transformed along with pLexA-CBD into the split-hybrid strain as well as into 
the two-hybrid L40 strain [Vojtek. a al.. supra] in order to identify false 
positives and confirm that the mutant CREB proteins did not interact with 
CBP. Among the 193 cDNAs re-screened. 152 did not interact with CBP in 
the yeast two-hybrid system, 15 interacted weakly, and 26 interacted like wild 
type CREB. 

Following these two screening steps, the 152 CREB mutants 
were sequenced. Seventy CREB mutants were found to contain a single 
amino acid change. Sixty four CREB mutants contained two amino acid 
residue mutations and 13 mutants contained more than two amino acid 
mutations. Mutants containing more than one amino acid alteration were not 
analyzed further. The expression level of mutant proteins having one amino 
acid change were determined using a standard 0-galactosidase assay. 

The CREB mutations identified in the split-hybrid screen were 
shown to carry amino acid changes centered around the phosphorylation site 
at serine at position 133. No disrupting mutations were found to contain 
amino acid alterations outside of the region between amino acids 130 to 141. 
Most of the mutations abrogated the PKA phosphorylation region, but others 
were identified at isoleucine position 137, leucine at position 138, and leucine 
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at position 141. The mutations at positions 137, 138, and 141 generally 
changed the hydrophobic residues at these positions to polar residues. The 
abUity of the split-hybrid system to detect only a limited number of CREB 
mutants, many of which have been proposed previously to disrupt CREB 
association with CBP [Parker, et ai, Mol. Cell. Biol. 16, 694-703. (1996)], 
indicates the specificity of the split-hybrid system. 

These results lead to interesting suggestions. Various CREB 
mutations were identified which disrupt CREB-CBP interaction and the 
majority of disrupting mutations occurred in the CREB PKA phosphorylation 
motif. This result was consistent with previous observations that 
nonphosphorylated CREB and CBP do not interact [Kwok. et ai., Naiure 
370:223-226 (1994)]. The most common motif for PKA phosphorylation is 
an RRX(S/T)X amino acid sequence but RX(S/T)X and KRXX(S/T)X are also 
phosphorylated [Kemp and Pearson. T.I.B.S. 15, 342-346 (1990)]. The 
arginine residues in the phosphorylation site arc critical for electrostatic 
interactions with acidic amino acid residues in the catalytic subunit of PKA 
[Knighton, et ai. Science 253, 414-420 (1991)], and consistent with this 
observation, CREB mutants with changes at arginine residues 130 and 131 
were identified in the split hybrid assay that did not interact with CBP. 

Results also showed that CREB mutations at amino acids 
proline at residue 132 and tyrosine 134 were unable to bind CBP. It is likely 
that the mutations at these residues adversely affect the structure of the 
phosphorylation motif, although these positions are generally thought to be 
less critical to CBP binding. It is possible that the substitution of proline at 
position 132 with threonine created a new phosphorylation site (RXTX) that 
interfered with the critical phosphorylation of serine at position 133. 
Although not generally thought to be part of the "classical" consensus PKA 
phosphorylation motif, hydrophobic amino acids are commonly found 
carboxy-terminal to PKA sites [Kemp, et at., T.I.B.S. 19:440-444 (1994)]. 
The importance of these flanking residues may explain the frequent occurrence 
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of disrupting mutations involving tyrosine at position 134. Further studies 
will be directed to determining if mutations of proline at position 134 and 
tyrosine at position 134 directly disrupt phosphorylation of serine at position 
133 or disrupt binding of CREB to CBP by some other mechanism. 

In addition, substitution of serine at position 133 with threonine 
also prevented the interaction of CREB and CBP. PKA protein substrates 
containing a phosphorylatable threonine residue are known to exist in nature 
{i.e., protein phosphatase inhibitor 1 and myelin basic protein), although they 
are less common than those with phosphorylatable serines [Zetterqvist, etal, 
,n Peptides and Protein Phosphorylation , (ed.) Kemp, B.E. (CRC Press, Boca 
Raton, FL), pp. 172-187 (1990)], and synthetic peptides containing serine to 
threonine substitutions are relatively poor substrates for PKA phosphorylation 
[Zetterqvist, etal, supra). In the split-hybrid assay, however, it is unclear 
whether the mutation of threonine at position 133 disrupts the CREB-CBP 
interaction or if the mutant fails to become phosphorylated. Despite previous 
observations that serine residue at position 133 of mammalian CREB can be 
phosphorylated by a variety of protein kinases other than PKA, for example 
calcium/calmodulin-dependent protein kinase D and IV, protein kinase C, and 
a nerve growth factor (NGF)-activated CREB kinase [Sheng, et at.. Neuron 
4:571-582 (1990); Sheng, et al, Science 252:1427-1430 (1991); Xie and 
Rothstein, J. Immunol. .154:1717-1723 (1995); Ginty, et at.. Cell 77:1-20 
(1994)], it is not known which, if any, of these particular protein kinases are 
able to phosphorylate CREB at the serine at position 133 in yeast. The 
requirement for integrity of the entire RRXSX amino acid sequence, however, 
suggests that PKA is a reasonable candidate. 

The second category of mutations were identified adjacent the 
PKA phosphorylation motif. Amino acids isoleucine at position 137 and 
leucine at position 138 have previously been suggested to be important for 
hydrophobic interactions of CREB with CBP fParker, etal., Mol. Cell. Biol. 
16. 694-703 (1996)]. In this study, most of the mutations at position 137 and 
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138 converted these hydrophobic residues lo polar amino acids. Thus, another 
possibility for the failure of these mutants to bind to CBP is that changes at 
these positions affect protein folding. Similarly, the mutation at position 141 
substituted a polar residue for the wild-type hydrophobic leucine, and this 
5 mutation also has the potential to affect protein folding. 

Substitution of the isoleucine at position 137 with a hydrophobic 
phenylalanine residue was found to disrupt the interaction between CREB and 
CBP as well. This result could have been the result of a detrimental effec t on 
folding because of the steric hindrance associated with the comparatively 
10 larger size of phenylalanine. Alternatively, the proposed hydrophobic 
interactions between CREB and CBP are somewhat specific. Structural 
studies will be directed to definitively determine how these mutations affect 
binding. 

Perhaps most surprising was the finding that critical mutations 
15 were restricted to a small region in the KID sequence, even though the 
relatively low affinity of phosphorylated CREB and CBP, determined to be 
between 250 and 400 nM by fluorescence anisotropy measurements [Kwok, 
et at.. Nature 370, 223-226 (1994)], is consistent with a restricted protein 
binding domain. The capability of the split-hybrid system to screen for a 
20 limited number of CREB mutants suggests that the system is highly specific, 
and thus, should be useful to identify mutations which disrupt interacts 
between other pairs of binding proteins. 
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Example 4 
Tax/SRF Binding Interaction 

To further investigate the feasibility of using the split-hybrid 

system to study protein-protein interactions, a pair of well characterized 

interacting proteins, SRF and Tax, was tested. Previous studies indicated that 

SRF and Tax interact in a standard yeast two-hybrid system suggesting that 

the proteins may be utilized in the split hybrid assay. Plasmid pLexA-SRF, 

containing a human SRF cDNA fused to the LexA DNA binding domain, was 

transformed into strain YI584 along with either pVP16-Tax or pVP16 alone. 

As with the P LexA-VP16 transformation, the yeast strains co-expressing 

LexA-SRF and VP16-Tax failed to yield any colonies on medium lacking 

histidine. In contrast, when LexA-SRF was co-transformed with a vector 

encoding the VP1 6 activation domain alone, yeast growth occurred on medium 

lacking histidine, suggesting that TetR expression was not activated. These 

results demonstrated that a protein-protein interaction in the split-hybrid 

system can effectively prevent yeast growth and further indicated the utility 

of the assay for the study of various protein/protein interactions. 

Example 5 
Casein Kinase Binding Assays 

Hn-25 

In another example of use of the split hybrid assay to examine 
protein/protein interactions, Hrr25, a yeast casein kinase isoform, or human 
casein kinase I isoform 5, was employed in the assay with a known binding 
partner protein. 

Previous work using the two hybrid assay had identified three 
genes encoding proteins which interact with the yeast casein kinase isoform 
Hrr25. Proteins encoded by the genes were designated TTH1, TTH2, and 
TTH3. The Hrr25 expression construct which was generated for use in the 
two hybrid assay was used in combination with the individual TTH encoding 
constructs in the split hybrid assay to determine if interaction between the 
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binding partners would decrease growth of assay yeast cells on media lacking 
histidinc. Construction of the Hrr25 expression plasmid and isolation of 
plasinids encoding TTH proteins is discussed below. 

In order to identify genes encoding proteins that interact with 

5 S. cerevisiae HRR25 CKI protein kinase, a plasmid library encoding fusions 
between the yeast GAL4 activation domain and S. cerevisiae genomic 
fragments ("prey" components) was screened for interaction with a DNA 
binding domain hybrid that contained the E. coli lexA gene fused to HRR25 
("bait" component). The fusions were constructed in plasmid pBTM116 

10 which contains the yeast TRP1 gene, a 2/x origin of replication, and a yeast 
ADHI promoter driving expression of the E. coli lexA protein containing a 
DNA binding domain (amino acids I to 202). 

Plasmid pBTM 1 1 6: .HRR25 encoding the lexA::HRR25 fusion 
protein was constructed in several steps. The DNA sequence encoding the 

15 initiating methionine and second amino acid of HRR25 was changed to a Sma\ 
restriction site by site-directed mutagenesis using a MutaGene mutagenesis kit 
from BioRad (Richmond. California). The DNA sequence of HRR25 is set 
out in SEQ ID NO: 39. The oligonucleotide used for the mutagenesis is set 
forth below, wherein the Smal site is underlined. 

20 5'-CCTACTCrTAGGCjCC£GGTCTTTTTAATGTATCC-3' 

(SEQ ID NO: 37) 

After digestion with Smal, the resulting altered HRR25 gene was ligated into 
plasmid pBTM116 at the Smal site to create the lexA::HRR25 fusion 
construct. 

25 Interactions between bait and prey fusion proteins were detected 

in yeast reporter strain CTY10-5d (genotype =MA Ta ade2 trpl-901 leu2- 
3,112 his 3-200 gal4 gal80 URA3::lexA op-lacZ.) [Luban, et at.. Cell 
73: 1067-1078 (1993)] carrying a lex A binding site that directs transcription of 
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20 



lacZ. Strain CTY10-5d was first transformed with plasmid 
pBTMH6::HRR25 by lithium acetate-mediated transformation flto, a al., 
J. Bacterial. 153: 163-168 (1983)]. The resulting transformants were then 
transformed with a prey yeast genomic library prepared as GAL4 fusions in 
the plasmid pGAD [Chien. et al.. Proc.Natl.Acad.Sci (USA) 27:9578-9582 
(1991)] in order to screen the expressed proteins from the library for 
interaction with HRR25. A total of 500,000 double transformants were 
assayed for 0-gaIactosidase expression by replica plating onto nitrocellulose 
filters, lysing the replicated colonies by quick-freezing the filters in liquid 
nitrogen, and incubating the lyscd colonies with the blue chromogenic 
substrate5-bromo-4-chloro-3-indolyl-/3-D-galactosidefX-gal).0-galactosidase 
activity was measured using Z buffer (0.06 M Na 2 HP0 4 , 0.04 M NaH 2 P0 4 , 
0.01 M KCI. 0.001 M MgS0 4? 0.05 M /3-mercaptoethanol) containing X-gal 
at a concentration of 0.002% [Guarente, Meth. Enzymol. 707:181-191 (1983)]. 
Reactions were terminated by floating the filters on 1M Na 2 C0 3 and positive 
colonies were identified by their dark blue color. 

Library fusion plasmids (prey constructs) that conferred blue 
color to the reporter strain co-dependent upon the presence of the 
HRR25/DNA binding domain fusion protein partner (bait construct) were 
identified. The sequence adjacent to the fusion site in each library plasmid 
was determined by extending DNA sequence from the GAL4 region. The 
sequencing primer utilized is set forth below. 



5 ' -GG AATC ACTAC AGGG ATG-3 ' (SEQ ID NO: 38 ) 

DNA sequence was obtained using a Sequenase version n kit (US 
Biochemicals, Cleveland, Ohio) or by automated DNA sequencing with an 
ABI373A sequencer (Applied Biosystems, Foster City, California). 

Four library clones were identified and the proteins they encoded are 
designated herein as TIH proteins 1 through 4 for Targets Interacting with 
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HRR25-like protein kinase isoforms. The TEH1 portion of the TIH1 clone 
insert corresponds to nucleotides 1528 to 2580 of SEQ ID NO: 40; the TTH2 
portion of the TIH2 clone insert corresponds to nucleotides 261 1 to 4053 of 
SEQ ID NO: 41; and the TIH3 portion of the TTH3 clone insert corresponds 
5 to nucleotides 248 to 696 of SEQ ID NO: 42. Based on DNA sequence 
analysis of the TTH genes, it was determined that TEH I and TTH3 were novel 
sequences that were not representative of any protein motif present in the 
GenBank database (July 8, 1993). TTH2 sequences were identified in the 
database as similar to a yeast open reading frame having no identified 

10 function. (GenBank Accession No. Z2326I, open reading frame YBL0506) 
When the various TTH proteins were used in the split hybrid 
assay in combination with Hrr25, it was observed that Hrr25/TIH3 binding, 
previously determined to be weaker than Hrr25/TTH2 or Hrr25/TIH1 
interactions, produced the lowest level of growth in the transformed yeast 

15 strain. 

CKI5 

In order to isolate cDNAs which encode proteins that interact 
with CKI6, the two hybrid assay was performed using a LexA-CKIS fusion 
protein as bait. The coding region of CKI6 was subcloned into a BamHl site 

20 of pBTMl 16 and transformed into a yeast strain designated CKI6/L40 (MAT 
a his3 A200 trp 1 -90 1 leu2-3 1 1 2 ade2 LYS : : (lex Aop) 4 fflS3 URA3 : : (lexAop) 8 - 
IcZ GAL 4). CKI6/L40 was subjected to a large scale transformation with a 
cDNA library made from mouse embryos staged at days 9.5 and 10.5. 
Approximately 40 million Iransformams were obtained. Eighty-eight million 

25 were plated onto selective media lacking leucine, tryptophan and histidine. 
The ability of yeast transformants to grow in the absence of histidine 
suggested that there was an interaction between CKI6 and some library 
protein. 

In a second screening, interaction of the two proteins was 
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assayed by the ability of the interaction to activate transcription of 0- 
galactosidasc. Colonies that turned blue in the presence of X-gal were 
streaked onto media lacking leucine, tryptophan and histidine, grown up in 
liquid culture and pooled for isolation of total DNA. Isolated DNA was used 
5 to transform E. coli strain 600 which lacks the ability to grow on media 
lacking leucine. Colonies that grew were used for plasmid preparation and 
three classes of cDNA were identified. One class was closely related to a 
Drosopliila transcription factor dCREBa. 

When CKK/CREB interaction was examined in the split hybrid 
10 assay, cells were shown to grow on media containing histidine, but in the 
absence of histidine, growth was inhibited. Addition of small amounts of 
tetracycline to the cell culture restored the cell's ability to grow, suggesting 
that the interaction between CKI6 and CREBa was very weak. 

Example 6 

1 5 AKAP 79 Binding Assays 

Expression Plasmid UHlmvt 

In still another example of use of the split hybrid assay to 
examine protein/protein interactions, an anchoring protein for the cAMP 
dependent protein kinase, AKAP 79, was utilized separately with binding 

20 partner proteins including the cAMP protein kinase regulatory subunit type I 
(RI), the cAMP dependent protein kinase regulatory subunit type II (RE) or 
calcineurin (CaN). Plasmids used in the assay were constructed as described 
below. 

A 1.3 kb NcollBamm fragment containing the coding region 
25 of AKAP 79 was isolated from a pETl Id backbone and ligated into plasmid 
pASl. Plasmid pASl is a 2 micron based plasmid with an ADH promoter 
linked to the Gal4 DNA binding subunit [amino acids 1-147 as described in 
Keegan et al. , Science, 231 :699-704 (1986)], followed by a hemagglutin (HA) 
tag, polyclonal site and an ADH terminator. The expressed protein was 
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therefore a fusion between AKAP 79 and the DNA binding domain of Gal4. 

Plasmids encoding RI, RD or CaN were isolated from a pACT 
murine T cell library in a standard two hybrid assay using the AKAP 79 
expression construct described above. Plasmid pACT is a Ieu2, 2 micron 
5 based plasmid containing an ADH promoter and terminator with the Gal4 
transcription activation domain n [amino acids 768-881 as described in Ma 
and Ptashne, Cell, 48:847-853 (1987)], followed by a multiple cloning site. 
RI, RH and CaN encoding plasmids were isolated as described below. 

A 500 ml SC-Trp yeast cell culture (OD^ = 0.6-0.8) was 

10 harvested, washed with 100 ml distilled water, and repelleted. The pellet was 
brought up in 50 ml LiSORB (100 mM lithium acetate, 10 mM Tris pH8, 1 
mM EDTA pH8, and 1 M Sorbitol), transferred to a I liter flask and shaken 
at 220 rpm during an incubation of 30 minutes at 30°C. The cells were 
pelleted, resuspended in 625 nl LiSORB, and held on ice while preparing the 

15 DNA. 

The DNA was prepared for transformation by boiling 400 10 
mg/ml salmon sperm DNA for 10 minutes after which 500 til LiSORB was 
added and the solution allowed to slowly cool to room temperature. DNA 
from a Mu T cell library was added (40-50 jig) from a 1 mg/ml stock. The 

20 iced yeast cell culture was dispensed into 10 Eppendorf tubes with 120 /tl of 
prepared DNA. The tubes were incubated at 30°C with shaking at 220 RPM. 
After 30 minutes, 900 id of 40% PEG 3350 in 100 mM Li acetate, 10 mM 
Tris. pH 8, and 1 mM EDTA, pH 8, was mixed with each culture and 
incubation continued for an additional 30 minutes. The samples were pooled 

25 and a small aliquot (5 /d) was removed to test for transformation efficiency 
and plated on SC-Leu-Trp plates. The remainder of the cells were added to 
100 ml SC-Leu-Trp-His media and grown for one hour at 30°C with shaking 
at 220 RPMS. Harvested cells were resuspended in 5.5 ml SC-Leu-Trp-His 
containing 50 mM 3AT (3-amino triazole) media and 300 id aliquots plated 

30 on 150 mm SC-Leu-Trp-His also containing 50mM 3 AT. Cell were left to 
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grow for one week at 30°C. 

After four days, titer plates were counted and 1. lxl 0 5 colonies 
were screened, large scale 0-gal assays were performed on library plates and 
ten positive clones were isolated for single colonies. One of these colonies 
grew substantially larger than the rest, and was termed clone II. 1. Sequence 
from clone 11.1 revealed an open reading frame 487 aa long which was 
correctly fused to the Gal-4 activation domain of pACT. The NIH sequence 
database was searched and the sequence was found to be closely homologous 
to the human calmodulin dependent protein phosphatase, calcineurin. 

Additional screening using pACT Mu T-cell library DNA and 
the pASI AKAP 79 bait strain was performed in order to identify other AKAP 
79 binding proteins by the protocol described above. Results from screening 
approximately 21 1 ,000 colonies gave one positive clone designated pACT 2-1 . 
Sequencing and a subsequent data base search indicated that the clone had 
91 % identity with rat type la regulatory subunit of protein kinase A (RI). 

The library was rescreened using the same AKAP 79 bait and 
fifteen positives were detected from approximately 520,000 transformants. Of 
these fifteen, eleven were found to be homologous to the rat regulatory 
subunit type I of PKA. Each of these isolates were fused to the 5' 
untranslated region of RI and remained open through the initiating methionine. 

Split Hvhrid Analy^s 

In split hybrid analysis of AKAP79 binding interactions, a 
plasmid was first constructed for expression of a LexA:AKAP 79 fusion 
protein. An AKAP 79 coding region was excised from pAS AKAP 79 as an 
NcoVBamm fragment and inserted into pBTMl 1 6 previously digested with the 
same enzymes. The resulting plasmid was designated pBTMl 16-AKAP79. 

Approximately 50,000 W303 yeast cells (strain YI665, see 
Table 1) in logarithmic growth were rinsed in media lacking histidine, 
suspended in 100 „! to 200 pi of the same media, and plated on agar lacking 
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histidine (to select for absence of protein/protein interaction) and also lacking 
leucine and tryptophan (to select for transformants bearing expression 
constructs encoding AKAP 79 and its binding partner). When RE was 
employed as the AKAP 79 binding partner, 2 to 4 M M tetracycline and 5 mM 
5 3AT were required to prevent the transformed host from growing under 
conditions where the expressed proteins interacted. 

Once conditions were established under which growth of the 
transformed host was eliminated, various candidate inhibitor compounds were 
separately added to the agar. It was presumed that if one of the candidate 

10 compounds was capable of disrupting AKAP 79 interaction with the binding 
partner protein, growth of the transformed host should be detectable in the 
vicinity of the compound on the agar. In the split hybrid assay wherein 
AKAP 79 and RII binding was examined, 2/d of a 30 mM stock solution of 
ICOS Compound 4273 in DMSO, 2 ftl of a 10 mM stock solution of ICOS 

15 Compound 1062 in DMSO, and 2 M l DMSO alone (as a negative control) 
were spotted on to the plate which was incubated at 30°C for four to five 
days. For ICOS Compound 4273 a ring of growth was detected. 

In order to determine an ICjq for an inhibitor identified as 
described above, alternative methods may be used. In one method, the 

20 inhibitor compound is added to the agar over a range of concentrations. 
Ideally, the compound is diluted to the point that host cell growth is 
essentially not detectable. 

In another method, a 96 well plate is used and the compounds 
of interest are serially diluted across one row of a 96 well plate, one 

25 compound per row. Media lacking histidine, tryptophan, and leucine is added 
(presuming that the expression plasmids encoding the binding partners also 
encode trp and leu proteins) along with the appropriately transformed host 
yeast strain. Tetracycline and 3AT are added at concentration previously 
determined to extinguish growth of the transformed host cell. After two to 

30 Five days incubation at 30°C, the plate wells are read at approximately 600 
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20 



nm using a plate reader. The concentration of inhibitor half way between zero 
and the lowest concentration that permits growth of the host cell to the level 
observed on media containing histidine is estimated to be IC 50 . 

A modification of this second method is particularly amenable 
for use in a high throughput screen of large numbers of candidate inhibitors. 
For example, rather than attempting to determine the IC 50 for a previously 
identified inhibitor, separate candidate inhibitors are added to each well of a 
96 well plate, preferably at more than one concentration, and host cell growth 
determined after several days incubation. Inhibitory activity of compounds 
identified in this manner is confirmed on an agar plate and the IC 50 
determined on 96 well plates, each assay as described above. 

Example 7 

General Application of The Split-Hybrid Screen 
In order to examine general utility of the split hybrid system, 
various experiments were conducted with binding proteins known to interact. 
In addition, a number of control experiments were included in order to 
determine if the effects observed with the known binding partners were in fact 
due to protein/protein interaction. 

A - Yeast Assay .Strain Construction 

Yeast transformants used in assays indicated below were 
derived from LYS2-deficient strains AMR69 (Mat a hisS lys2 leul trpl, 
URA3:LexA::LacZ) and AMR70 (Ma. a his3 lys2 trpl Ieu2, 
URA3:LexA::LacZ) [HoHenberg, et al.. Mol. Cell. Biol. 15, 3813-3822 
(1995); Chien, a al., Proc. Natl. Acad. Sci. (USA) 88:97578-9582 (1991); 
Fields and Song, Nature 340:245-246 (1989)]. Yeast were grown in YEPD 
or selective minimal medium using standard conditions [Sherman, F., et al., 
Methods in Yeast Genetigj , Cold Spring Harbor Lab., Cold Spring Harbor, 
NY (1986): Methods in Enzymology, Vol. 194 Guide to Yeast Genetics and 
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Molecular Biology. Eds. Christine and Fink]. Derivatives of both AMR69 
and AMR70 strains lacking URA3 were first generated by streaking cells on 
synthetic media containing 5 mg/ml 5-fiuoro-orotic acid (5FOA) [Methods in 
Enzymology, Vol. 194 Guide to Yeast Genetics and Molecular Biology. Eds. 
Christine and FinkJ. Two URA3 deficient mutants were required due to the 
fact that these strains were subsequently mated. URA3-deficient colonies 
were confirmed by testing for uracil auxotrophy and deletion of the 
URA:LexA::LacZ locus was confirmed by an absence of 0-galactosidase 
activity assayed by standard methods. The mutant strains selected were 
designated 69-4 and 70-1. 

Targeted integration of pRS306/8xLex Aop/TetR was carried out 
by transforming [Hollenberg, etal., Mol. Cell. Biol. 15, 3813-3822 (1995)] 
the 69-4 strain with plasmid linearized at a unique Ncol site. The reporter 
gene construct was constructed using parental plasmid pRS306 which encodes 
URA3 as a selectable marker. Stably integrated plasmid thereby permitted 
selection on media lacking uracil. The positive uracil prototrophic strains 
were examined by Southern analysis to confirm insertion of the plasmid 
sequences. 

Targeted integration of pRS303/2xtetop-LYS was carried out 
by transformation [Hollenberg, et aL, supra] of strain 70-1 with plasmid 
linearized at a unique Hpa\ site. The resulting lysine prototrophic strains 
were examined by Southern analysis to confirm insertion of the plasmid DNA. 

The AMR69 derivative strain (MAT a) containing the 
pRS303/2xtetop-LYS insertion was mated with the AMR70-derivative strain 
(MAT a) containing P RS306/8xLexAop/TetR and mated cells were selected 
on media lacking both lysine and uracil. Single colonies were grown u P and 
tested for the ability to grow on media lacking histidine. The resulting strain 
was designated YI584. In instances where yeast strains were transformed with 
other reporter gene pair combinations, the strains were uniquely designated. 

Yeast bearing integrated reporter gene constructs were 
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subsequently transformed [Hollenberg, et a!., supra] with plasmids encoding 
chimeric binding protein. Plasmids encoding the LexA DNA binding region 
were generally derived from parental plasmid pBTM116 which also encodes 
TRP1 as a selectable marker. Plasmids encoding the VP16 transactivating 
5 domain were generally derived from parental plasmid pVP16 which also 
encodes LEU2 as a selectable marker. Yeast cells which were successfully 
transformed with the four exogenous plasmids were therefore selected by an 
ability to grow on media lacking lysine, uracil, tryptophan, and leucine. 
Plasmids encoding various binding proteins were transformed into the yeast 
10 assay strain as indicated below. 



B. Liquid Assay 

After three days growth at 30°C on selection media as 
described above, a pool of colonies from each transformation was collected 
and diluted in 5 ml selective media. The mixture was vortexed and 
immediately sonicated for ten seconds. Cells in the resulting suspension were 
counted and seeded at 1000 cells/ml in selective media, 2 ml per 15 ml tube. 
Tetracycline, 3AT. and histidine were included as determined appropriate by 
the method described above. Each aliquot of cells was incubated with shaking 
for two days at 30°C and cell density measured at OD 600 . 

C C haracterization of the A™ y 

The utility of the split-hybrid assay was first determined using 
well characterized binding proteins and various controls. 

In an initial study, YI584 cells were transformed with plasmids 
P LexA-VP16 and pLeu. While the expressed proteins from the two plasmids 
do not interact, P LexA-VPI6 encodes a fusion protein containing the VP16 
activation domain fused directly to LexA which contains a DNA binding 
domain. The chimeric LexA-VP16 protein is a strong transactivator for a 
promoter containing LexA operators. Plasmid pLeu is essentially a blank used 
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as a control co-transformation plasmid. 

Yeast transformed with the LexA-VP16 plasmid were able to 
express TetR protein as indicated by gel shift analysis using a let operator 
oligonucleotide. In addition, the cells were unable to grow on media in the 
absence of histidine. Combined, these observations suggested that 
overexpressed TetR protein was capable of binding to let operators and 
preventing the expression of HIS3. The transformed yeast grew on plates 
containing histidine, further indicating that overexpression of TetR did not 
have a toxic effect on the assay cells. 

The results were consistent with previous observations and 
supported the earlier suggestion that activation of TetR expression, either 
through a single transcription factor or association of individual transcription 
factor domains, is capable of preventing assay cell growth on media lacking 
histidine, presumably by eliminating HIS3 production. 

Example 8 

Split-Hybrid Assay With Weakly Interacting Binding Proteins 

Protein/protein interaction was examined in the split-hybrid 
assay to determine utility of the system using two fusion proteins known to 
interact weakly. In this instance, the binding proteins were a 283 amino acid 
fragment of a cAMP regulatory binding protein (CREB283) fused to LexA 
and a fragment of the CREB binding protein consisting of the CREB binding 
domain (CBD) fused to VP16. 

In this assay, yeast strain YI584 described above was employed 
and transformation carried out as previously described. In a first assay, 
plasmids pLexA-CREB and pVP16-CBD were transformed into the cells and 
cell growth was observed in the absence of histidine in the media. Expression 
of the fusion proteins was confirmed by Western blotting. Attempts to 
decrease cell growth by titration with 3AT were unsuccessful in that the 
concentration of 3AT required to reduce growth in cells transformed with 
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pLexA-CREB and pVPl6-CBD also eliminated growth in cells transformed 
with pLexA-CREB and the control plasmid pVP!6. 

In light of these results, two alternative approaches were taken 
in order to permit study of binding proteins wherein the interaction is 
5 relatively weak . Under the assumption that the system was failing at the level 
of TetR transcription, alternative approaches were taken in attempts to amplify 
the TetR effect on expression of HIS3 gene. To achieve this end, assay cells 
were transformed with reporter constructs which encoded multiple let operator 
sequences upstream from the HIS3 gene. In the second approach, the H1S3 
10 promoter used to drive expression of the TetR gene was replaced with the 
stronger alcohol dehydrogenase (ADH) promoter. 

In YI596 cells wherein the ADH promoter replaced the HIS3 
promoter to drive TetR expression, transformation with plasmids pLexA- 
CREB and pVP16-CBD showed substantially decreased growth on his' media 
15 as compared to that in assay strain YI592 wherein the HIS3 promoter was 
used to drive TetR expression. However, in cells transformed with plasmids 
pLexA-CREB 341 -Ml and pVP16-CBD, no decrease in assay cell growth was 
detected on media lacking histidine. These results indicate that incorporation 
of the ADH promoter to drive TetR expression may be more useful in studies 
20 involving binding proteins that have low affinity. 

When assay strains were utilized which incorporated plasmids 
wherein expression of the HIS3 gene was driven by multiple copies of the tet 
operator, transformed cell lines did not grow well enough to indicate potential 
utility in subsequent assays. 



WO 98/13502 



PCT/US97/17276 



Example 9 
General Assay Methods 

A. "Fine Tuninp" 

In instances where either of the test fusion proteins possesses 
intrinsic capacity for transcriptional activation, TetR will be expressed and 
growth of the assay strain media lacking histidinc will be depressed 
proportional to the level of TetR expression. In order to restore growth of 
these cells to approximately the level observed on media containing histidine, 
the initially transformed assay yeast strains arc grown in the presence of 
increasing concentrations of tetracycline which binds to the TetR gene product 
and prevents TetR binding to the tet operator. Precise titration of expressed 
TetR with tetracycline, only to the point that growth of the assay strain is 
restored to the level detected in the presence of histidine, permits detection of 
subsequent decreased growth of the assay strain following increased TetR 
expression resulting from interaction of the test binding proteins. The 
empirically determined tetracycline concentration is therefore employed to 
increase "signal-to-noise" ratios under assay conditions. 

After an appropriate tetracycline concentration has been 
determined for each of the candidate assay strains, the cells arc transformed 
with the second plasmid encoding the second fusion binding protein. As 
before, growth of each candidate assay strain is examined on media in the 
presence and absence of histidine. A desirable yeast assay strain is chosen 
which shows vigorous growth in the presence of histidine and negligible 
growth on media lacking histidine (indicative of the expected protein/protein 
interaction and resultant decreased expression of HIS3). 

In instances where binding between the two test proteins is 
comparatively weak, TetR expression may not be sufficiently increased to 
abolish HISS expression and cells expressing the resultant low levels of H1S3 
will still grow on media which lacks histidine. Cells which show this low 
level of viability are grown in the presence of increasing concentrations of 3- 
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aminotriazole (3 AT), a competitive inhibitor in the histidine synthesis 
pathway, in order to reduce cell growth to negligible levels when plated on 
media lacking histidine. As with titration of TetR with tetracycline, addition 
of 3AT to the media is designed to increase the signal-to-noise ratio by 
providing significant changes in growth in the presence and absence of 
histidine in the media. 

In a practical application of the methods for fine tuning, binding 
between CREB and the CREB binding protein (CBP) is illustrative. Growth 
of the yeast strain YI584 transformed with pLexA-CBD, encoding the CREB 
binding domain (CBD) of CBP. and P VP16-CREB or pLexA-CBD and the 
control plasmid pVPI6 was substantially decreased and virtually 
indistinguishable growth rates were detected in both instances on media 
lacking histidine. This observation indicated that the LexA-CBD protein 
product possessed sufficient transacting capacity to eliminate fflS3 
production. In order to distinguish growth differences between assay cells 
transformed with either pVP16 and pVP16-CREB, increasing amounts of 
tetracycline were added to the media lacking histidine. 

In both transformants, tetracycline was able to relieve growth 
repression in a dose dependent manner, and at increasing concentrations of 
tetracycline, the difference in growth between the two colonies was 
increasingly magnified, with the most distinct growth difference observed 
following addition of tetracycline at 10 M g/ml. Addition of tetracycline was 
therefore able to overcome the intrinsic transacting capability of the LexA- 
CBD fusion protein. 



: the ultimate use of the split-hybrid system is for 
structure-function studies, mutagenesis studies, drug identification and library 
screens, it is important to minimize background growth that might be confused 
with disrupted protein-protein associations. This can be accomplished by the 
addition of 3AT, a competitive inhibitor of the HIS3 gene product For 
insunce, i„ the presence of 10 „g/ml of tetracycline, the yeast strain 
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transformed with pLexA-CBD and pVPI 6-CREB still conferred approximately 
12% growth of thai observed in the presence of his + media. To diminish this 
background, increasing concentrations of 3AT were added to the media in the 
presence of 10 M g/ml of tetracycline. At the 3AT concentration of 0.25 mM, 
the growth of the yeast strain expressing LexA-CBD and VPI 6-CREB was 
below 5%. while the growth of the control strain was still maintained at 70% 
of control levels. These results indicate that split-hybrid system can be 
modulated by 3AT in addition to tetracycline in order to effectively increase 
the signal-to-noise ratio. 



10 B- Preparation of yeast extract 

In order to assess the utility of various plasmids to function in 
the split-hybrid assay, a number of control experiments can be employed 
which lend insight into expression of a desired protein from the transformed 
plasmid. For example, standard immunological methodologies, i.e., 
15 immunoprecipitation, ELJSA, etc., can be used to determine to the extent to 
which a desired protein is expressed. Similarly, a variation of the gel shift 
assay (discussed immediately hereafter) can be used to determine both if a 
protein is expressed and if the expressed protein is capable of DNA binding. 
In each of these control assays, a yeast extract is required which can be 
20 prepared as follows. 

Extracts were prepared as described by Uppaiuri and Towle 
[Mol. Cell. Biol. 15:1499-1512 (1995)] and were used for electrophoretic 
mobility shift assays as discussed below. The yeast cells transformed with 
P LexA-VP16 were grown in 100 ml of selective synthetic medium lacking 
25 uracil, tryptophan, and lysine to a density of A^ = 1 . Cells were harvested 
and washed with 5 ml of EB (containing 0.2 M Tris-HCI, pH 8.0, 400 mM 
(NH 4 ) 2 S0 4 , 10 mM MgCI 2 , 1 mM EDTA, 10% glycerol, and 7 mM 0- 
mercaptoethanol). Cells were transferred to microcentrifuge tubes and 
collected by centrifugation. After resuspending in 200 /d EB containing 1 
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mM phenylmethylsulfonyl nuoride (PMSF), I^g/ml leupeptin, and l^g/ml 
pepstatin, a one half volume of glass beads was added. The suspension was 
frozen in a -80°C freezer for 1 hour and thawed on ice. Thawed cells were 
vortexed at 4°C for 20 minutes, after which an additional 100 M l EB was 
added, and cells were left on ice for 30 minutes. The suspension was 
centrifuged for 5 minutes, the supernatant was transferred to a new tube which 
was centrifuged for 1 hour in a microcentrifuge. The supernatant was then 
made to 40% with (NH^SO* and gently rocked for 30 minutes. After a 10 
minute centrifugation, the pellet was resuspended in 300 h \ of 10 mM 
HEPES, pH 8.0, 5 mM EDTA, 7 mM /3-mercaptoethanol, I mM PMSF, 1 
Mg/ml leupeptin, and 1 M g/n,l pepstatin. and 20% glycerol. The resulting 
suspension was dialyzed against the same buffer, and aliquots were stored at - 



?0°C. 



C Electrophnr rtic mohilir Y <=hift ^» Y o 

Electrophoretic mobility shift assays were performed as described by 
Shih and Towle [J. Biol. Chem. 267:13222-13228 (1992)]. Double-stranded 
let operator oligonucleotides were prepared by combining equivalent amounts 
of complementary single-stranded DNA (SEQ ID NOS: 7 and 8) in a solution 
containing 50 mM Tris-HCI, pH 8.0, 10 mM MgCI 2 , and 50 mM NaCI 2 , 
heating the mixture to 70'C for 10 minutes, and then cooling to room 
temperature. The annealed oligonucleotides were labeled by filling in 
overhanging 5 ' ends using the Klenow fragment of E. coli DNA polymerase 
I with [«- 32 P]dCTP. Binding reactions were carried out in 20 M l containing 
10 mM Tris-HCI, pH 7.5, 50 mM NaCl, 1 mM EDTA, 1 mM dithiothreitol, 
5% glycerol, and 2 mg of poly[d(I C)]. A typical reaction contained 20,000 
cpm (0.5-1 ng) of end-labeled DNA with 3-5 M g of yeast extract. Following 
incubation at 22°C for 30 minutes, samples were separated on a 4.5% 
nondenaturing polyacrylamide gel containing 50 mM Tris, 384 mM glycine, 
and 2 mM EDTA, P H 8.3. For competition binding experiments, the 
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conditions were exactly as above except that specific and nonspecific 
competitor DNAs were included in the binding mixture before the yeast 
extract was added. The concentration of tetracycline, a competitive inhibitor 
of TetlUtei operator binding, was 1 pM when utilized. 



Example 10 

Application of the Split-Hybrid Assay to Identify Agents 
That Prevent Receptor Desensitization and Drug Tachyphylaxis 

Over half of the drugs that are used clinically affect the function 
of seven transmembrane receptors. Although many of the characteristics of 
these receptors are distinct, two general features appear to be conserved. One 
is the ability to signal through dissociation of heterotrimeric G proteins. The 
second is the capacity to lose responsiveness to ligand binding in a process 
termed desensitization which is mediated by receptor phosphorylation and the 
subsequent binding of factors that recognize the phosphorylated state of the 
receptor which prevents continued signaling. Desensitization results in an 
intrinsic limitation to drug action imposed by the action of the drug itself, i.e. . 
activation of a receptor by a hormone or drug initiates mechanisms thai 
prevent subsequent responses to repeated administration of the same agent. 
The coupled mechanisms of activation and deactivation together have been 
termed "homologous desensitization," while the inability of a dmg to maintain 
its efficacy is known as "tachyphylaxis." Even though the mechanisms 
underlying homologous desensitization have been worked out in great detail 
over the past few years, there are currently no useful pharmacological 
approaches available that prevent the inactivation mechanism. 

The potential clinical utility of agents that could prevent or 
modulate drug desensitization is enormous. Four examples where therapy is 
limited by the inability of receptors to maintain responsiveness to drugs 
' include: (i) asthma wherein desensitization of airway adrenergic receptors 
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renders epinephrine treatment ineffective after a period of hours; (ii) 
congestive heart failure wherein desensitization of adrenergic and VIP 
receptors, coupled with an elevation of the B adrenergic receptor kinase 
(0ARK), prevents the inotropic effects of endogenous regulatory hormones; 
(iii) Parkinson's disease, wherein dopamine receptor desensitization limits the 
usefulness of agents like L-Dopa; and (iv) chronic pain wherein tolerance 
results from opiate receptor desensitization. Indeed, it is difficult to conceive 
of a pharmacological modality in use today that is not limited in its 
effectiveness by the phenomenon of desensitization. 

The biochemical basis for G protein-coupled receptor desensiti- 
zation involves three classes of proteins including arrestins, kinases and G- 
proteins, all of which have been cloned fLefkowitz, Nature Biotechnology 
14:283-286 (1996)]. Following activation of a seven transmembrane receptor, 
a region is phosphorylated by one or more G protein-coupled receptor kinases 
(known as GRKs 1-6). For example, in the ^-adrenergic receptor (0AR) and 
rhodopsin. the cytoplasmic tail is phosphorylated [Premont, et al.. J. biol. 
Chem. 269:6832-6841 (1994): Freedman, etal., J. Biol. Chem. 270:17953- 
17961 (1995); Palczewski,« al.. J. Biol. Chem. 266:12949-12955 (1991); 
Palczewski, etal.. J. Biol. Chem. 270:15294-15298 (1995)] while in the m2 
muscarinic receptor, the third cytoplasmic loop is phosphorylated [Nakata, et 
al., Eur. J. Biochem. 220:29-36 (1994)]. The best characterized members of 
the family of G protein receptor kinases are the BAR kinase (/SARK) and 
rhodopsin kinase which are both membrane-associated. While rhodopsin 
kinase contains an intrinsic membrane targeting signal pnglese. et aL , Nature 
359:147-150 (1992)], BARK appears to be targeted to the membrane by 
association with G protein By subunits [Pitcher, et al. . Science 257: 1264-1267 
(1992); Inglese, et al.. Nature 359:147-150 (1992)]. Once the substrate 
receptor for each kinase is activated, presumably by ligand binding, the kinase 
associates and phosphorylates serine and threonine residues on the receptor. 
The phosphorylated receptor then becomes a binding target for one or more 
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other proteins. In the case of 0AR, for example, phosphorylation allows 
binding of arresting which prevents association with G proteins and promotes 
receptor sequestration and desensitization. Using the 0AR as an exemplary 
desensitization model, it becomes apparent that multiple steps in the pathway 
appear to provide potential points of regulation each of which is amenable to 
the split-hybrid screen to identify molecules that can block the overall 
desensitization pathway. Specifically in the case of 0AR, the split hybrid 
system can be used to identify small molecules that: (i) prevent interaction 
between 0ARK and the G protein 0 subunit; (ii) inhibit PARK activity; and 
(iii) disrupt the /SARK.arresting complex. 

A- Plasmid rnr«tn,r|i»n. 

The study of G-protein receptor kinases in the split-hybrid 
system involves three or more recombinant proteins or two or more 
recombinant proteins and a recombinant peptide library. In the split-hybrid 
system discussed above, two yeast primary expression plasmids are employed: 
pBTMl 16 [Baitel et a!.. Cellular Interactions in Development: a Practical 
Approach, (cd) Hartley, IRL Press, Oxford, pp. 153-179 (1993)]. which 
encodes the I^xA-fusion protein and the TRPI selectable marker, and pVP16 
rHollenberge, a /../l/ 0 /. CelLBioL, 15:3813-3822(1995)], which encode,; the 
VPI6-fi.sion protein and the LEW selectable marker. In order to study 
interactions involving more than two recombinant proteins in the split-hybrid 
system, however, additional selectable markers are employed. Construction 
of additional yeast expression plasmids which are used to examine interactions 
between more than two binding proteins is discussed below. 
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1. Plasmid p DRM 

A DNA fragment comprising the ADH promoter and LexA 
sites, the TetR encoding gene, the nuclear localization signal, and the ADH 
tenninator sequence are removed from pRS306/4xLexAop/ADH::TetR with 
Sad. blunt-ended, and digested with &fl. The fragment is isolated and 
ligated into pRS303/2xtetop-LYS2 which has previously been digested with 
Notl, blunt-ended, and digested with &fl. The resulting plasmid, designated 
pDRM. is integrated into the LYS2 locus in the yeast genome as described 
above, and the resulting strain designated YIDRM. Placing the repressor gene 
and selectable marker reporter gene in theLKS2 locus allows ERA3 to be used 
a selectable marker. 

2. Plasmid p RSTTPAt 

A modified version of the pRS306 vector [Sikorski er at.. 
Genetics, 122:19-27 (1989)] containing the URA3 selectable marker gene is 
also used to encode additional recombinant proteins in the split-hybrid system. 
The plasmid, pRS426, has the 2 micron origin of replication inserted into a 
unique Ami site of pRS306. Plasmid pRS426 is further modified in the 
following manner: 

(i) The ADH promoter sequence is amplified by PCR from 
BTMI 16 using primers which incorporate into the amplification product the 
DNA sequence encoding the SV40 large T antigen nuclear localization signal 
(NLS) and an initiating ATG sequence 3' to the ADH promoter. The ADH 
promoter/NLS/ATG sequence is inserted into the polylinker of P RS426. 

(n) The ADH terminator sequence is amplified by PCR from 
BTMI 16 using primers which incorporate into the product a DNA sequence 
encoding an antibody tag, for example, FLAG, hemagglutinin protein (HA), 
or thioredoxin (Thio) (FLAG, HA, and Thio antibodies are available through 
Santa Cniz Biotechnology, Santa Cruz, CA) and DNA sequences encoding 
stop codons in all three frames to the 5' end of the ADH terminator sequence. 
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The antibody tag/stop codon/ADH tenninator sequence is inserted into the 
polylinker of pRS426. 



3. Plasmid pRSADF? 

PCR is used to engineer unique restriction sites, including for 
5 example, Bg[a, Eco47UL, MM, Nhel. and SphI, immediately adjacent the 5' 
and 3' ends of the URA3 cassette in pRSURA3. The URA3 cassette is 
digested from pRSURA3 and replaced with the ADE2 cassette which is 
amplified by PCR. 



4. Plasmid pBTMII6/AD4 

A fragment containing the ADH promoter, polylinker, and 
ADH terminator is digested from pAD4 [Young et al. , Proc. Nar'l. Acad. Sci. 
(USA), S6.-7989-7993 (1989)J with BamHl, blunt-ended and inserted into the 
blunt-ended Pvul site of BTMII6 as described [Keegan et al.. Oncogene. 
72.1537-1544 (1996)], and the resulting vector designated pBTM 1 16/AD4. 
PCR is also used to engineer a nuclear localization signal 3' of the ADH 
promoter as described above. This vector contains the TRP1 selectable 
marker and can encode two recombinant proteins: (i) a LexA-fusion protein 
and (ii) a protein expressed from the pAD4 region of the vector. 



B ffARK and G Protein p Subunit Bindin g 

In a first application of the split hybrid assay, disruption of 
binding between the carboxy-terminal domain of 0ARK, containing the 
pleckstrin homology (PH) domain, and the G protein j3 subunit (G0 2 ) is 
examined. Previous work indicates that the PH domain of 0ARK interacts 
directly with the By subunits of G proteins [Pitcher, J. A., et al. Science 
257:1264-1267 (1992) andTouhara. K. etal.,J.Biol.Chem. 269:10217-10220 
(1994)]. Consistent with this observation is work by Pumiglia, et al. 
[Pumiglia, K.M., et al., J.Biol.Chem. 270:14251-14254 (1995)] which 
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indicates thai G/3 2 interacts with Raf! in yeast and that the interaction is 
disrupted by 0ARK in vitro. 

A DNA fragment containing the carboxy-terminal 222 amino 
acids (residues 467 to 689) of 0ARK1, which includes the PH domain, is 
amplified by PCR from bovine 0ARK1 [Pitcher et aL. Science, 257:1264- 
1267 (1992)] and the gel-purified amplification product is inserted into 
pBTM116. The resulting plasmid is designated LexA-COOH-0ARK. A DNA 
fragment containing the entire coding sequence of G0 2 [Fong et aL. Proc. 
Nat'l. Acad. Sci. (USA). 8*3792-3796 (1 987) J is PCR amplified from pGEM- 
HZf(-)G/? 2 pnigez-Lluhi etal., JBC. 267:23409-23417 (1992)] and the gel- 
purified amplification product inserted into pVP16. The resulting plasmid is 
designated pVP16-G0 2 . PCR is used in a similar manner to clone the 
carboxy-terminal domain of #ARK into pVPI6 and G0 2 into pBTM116. 

0ARK and G/S 2 binding is first examined in the two-hybrid 
system to determine if expression of either binding partner as a fusion protein 
in yeast affects protein/protein interaction. Binding of the two proteins is then 
examined in the split hybrid assay in order to determine if protein/protein 
interaction is capable of abolishing growth of the assay yeast strain. As 
above, addition of tetracycline and/or 3-aminotriazole required to maximize 
the difference in growth in the presence and absence of histidine is empirically 
determined. 

Split-hybrid yeast strains containing jSARK and G0 2 subunits 
are used to screen libraries of small molecules. Several types of small 
molecule libraries can be examined in the split-hybrid assay, including for 
example, chemical libraries, libraries of products naturally produced by 
microorganisms, animals, plants and/or marine organisms, combinatorial, 
recombinatoriaJ, peptidomimetic, multiparallel synthetic collection, protein, 
peptide and polypeptide libraries. A library of small peptides can be cloned 
into pRSURA3 as described [Yang et aL, Nuc. Acids Res., 23.1 152-1 156 
(1995) and Colas etal.. Nature, 580:548-550)] . Peptides corresponding to the 
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carboxy-terminus of /J ARK or other GRKs which have previously been shown 
to block calcium channel denization in intact neurons, presumably by 
blocking BASK and GB 2 binding and subsequent trafficking of 0ARK to the 
cellular membrane [Diverse-Pierluissi, etal.. Neuron 16:579-585 (1996)] can 
be identified in such a screen. Further, it is important to show that the 
molecules identified through the split hybrid selection affect BARKQB 
interaction as opposed to, for example, tetracycline analogues identified in the 
screen that would not be useful to specifically modulate BARKIGB 2 binding 



B Identification of BARK inhih;,,^ 

In a second approach, agents that directly inhibit 0ARK 
function are identified in a modification of the split-hybrid system. While 
identification of specific 0ARK inhibitors may be difficult, preliminary data 
from split hybrid assays using CREB/CBP binding partners indicates that the 
system can be used to identify serine kinase inhibitors. The serine kinase 
results also suggest several approaches can be employed in attempts to 
overcome potential problems in identifying BARK inhibitors. 

Briefly, binding between the phosphorylated G-protein coupled 
receptor (P-GR) and arresting is examined first in the standard two hybrid 
assay, followed by identification of inhibitors of P-GR/arresting binding in the 
split hybrid assay. For these studies, fragments of three G protein-coupled 
receptors are examined: the carboxy-terminal tail of B 2 AR and the third 
cytoplasmic loop of the m2 muscarinic receptor. A DNA fragment containing 
the carboxy-terminal tail of the B 2 AR (amino acids 330 to 413) is PCR 
amplified [Kolbilka e, al.. JBC. 262:7321-7327 (1987)] and the gel purified 
product inserted into pBTMl I6/Ad4 to produce a LexA-^AR fusion gene. 
The resulting plasmid is designated pBTM-0 2 AR/AD4. A DNA fragment 
containing the third cytoplasmic loop of the human m2 muscarinic receptor 
(nucleotides 268-324) is amplified from P GEX-I3m2 fHaga et al IBC 
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269. 12594-12599 (1994)] by PCR and cloned into pBTMl 16/Ad4 creating a 
LexA-m2 fusion gene. The resulting plasmid is designated pBTM-m2/AD4. 
The entire bovine /3 ARK 1 coding sequence [Benovic et al., Science, 246:235- 
240 (1989)] is PCR amplified and cloned into the polylinker region originating 
from AD4 in pBTM-/J 2 AR/AD4 and pBTM-m2/AD4. The resulting plasmids 
are designated pBTM-/3 2 AR/AD4-/JARK and pBTM-m2/AD4-0ARK, 
respectively. PCR is used to amplify the DNA fragment containing bovine 
/Sarresting-I (amino acids 1 to 437) [Lohse, et al.. Science, 248: 1547- 1550 
(1990)]. This fragment is inserted into pVP16 and is designated pVPl6- 
0arresting-l. PCR is used to amplify the DNA fragment containing rat 
0arresting-2 (amino acids 1 to 428) [Attramadal, ei al.. JBC. 267.17882- 
17890 (1992)] which is inserted intopVP16 to give plasmid pVP16-0arresting- 
2. A PCR strategy is also used to clone arresting into the pBTMl 16/AD4- 
0ARK plasmid and the /JAR and m2 fragments into pVP16. As above, the 
yeast split-hybrid Y1DRM strain is transformed with the P-GR-arresting along 
with peptide libraries (cloned into P RSURA3) or grown following 
transformation in the presence of combinatorial drug libraries. 

Inhibitors identified in the split hybrid assay should effect 
disruption of protein/protein interaction either by: (i) inhibiting 0ARK 
phosphorylation of the receptor, thus preventing recognition of the receptor 
by arresting, or (ii) by physical disruption of binding between the receptor and 
arresting. Agents that allow yeast growth for trivial reasons, i.e. , tetracycline 
analogues, can be easily identified through use of simple controls. 

A first potential problem to overcome in this study is that 
cytoplasmic /J ARK enzyme must be targeted to the substrate receptor and, 
once targeted, must phosphorylate the receptor at appropriate sites. In normal 
cells, 0y association serves to target 0ARK to the cell membrane; the 0 
subunit binds to both the 0ARK PH domain and the isoprenylated y subunit 
in association with the membrane. One possible means to encourage the 
necessary specific interactions is to target the binding components in the assay 
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by tagging the proteins with nuclear localization signals, i.e., 0ARK. the 
receptor cytoplasmic tail, and arresting, to the nucleus. The plasmids 
proposed for the study of the P-GR-arresting interaction all contain nuclear 
localization signal sequences adjacent to recombinant gene sequence. 

A second Problem is somewhat more difficult to approach. The 
current model is that receptors must be activated by ligand binding before 
being phosphorylated by 0ARK, i.e., targeting of 0ARK via 0 Y is not 
sufficient for receptor phosphorylation. There are two possible explanations 
for this requirement. The first is that phosphorylation sites on the receptor arc 
10 masked in the absence of ligand and ligand binding causes a conformational 
change which "unmasks" the phosphorylation sites. If this is the ca^e, a 
fragment of the receptor containing the immediate phosphorylation site may 
be used as the 0ARK target. However, although peptides representing 
portions of the BAR cytoplasmic tail can be phosphorylated by BARK, the. K m 
15 for the phosphorylation reaction is poor, suggesting that the kinase may 
require some other part of the receptor for binding and that the unmasking of 
this binding site by agonist is a critical step. 

This problem is addressed in two ways. In the first, the m2 
muscarinic receptor is used in place of the BAR in view of previous results 
20 which indicate that the m2 protein is a good substrate for jSARK. The third 
cytoplasmic loop of the m2 receptor serves as both the binding site and 
phosphorylation site for kinase and which should allow use of a LexA7m2 
receptor third cytoplasmic loop fusion gene as one component in the screening 
system. 

25 An alternative approach is to artificially mimic the activated 

state of the receptor. Haga, et al. [J. Biol. Chem. 269.12594-12599 (1994)] 
nave shown that the activity of 0ARK can be stimulated in vitro in the 
presence of mastoporan. a bee venom peptide. Mastoporan is believed to 
mimic the cytoplasmic face of an activated receptor and has been shown to 

30 increase the affinity of BARK for a GST-m2 receptor fusion protein by over 
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four orders of magnitude. The same effect can be seen by using peptides 
representing the flanking regions of the m2 third cytoplasmic loop. Thus, 
mastoporan should also activate 0ARK in the two-hybrid yeast strains, allow 
phosphorylation of the receptor fusion protein, and promote interaction with 
5 arresting. If mastoparan is needed, oligonucleotides containing the coding and 
non-coding nucleotide sequences of the 14-mer peptide (INLKALAALAKKIL- 
NH 2 . SEQ ID NO: 43) are annealed and ligated into prSADK. The yeast 
split-hybrid strain YIDRM is transformed with pBTM /SAR (or m2)/AD4- 
0ARK, pVP16-arresting, pRSADE2-mastoparan ) and a pRSURA3-peptide 
10 library or combinatorial drug library. 



Numerous modifications and variations in the invention as set 
forth in the above illustrative examples are expected to occur to those skilled 
in the art. Consequently only such limitations as appear in the appended 
claims should be placed on the invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Hoekstra, Merl F. 

(ii) TITLE OF INVENTION: Methods to Identify Compounds For 
Disrupting Protein/Protein Interactions 

(iii) NUMBER OF SEQUENCES: 4 3 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Marshall, O'Toole. Gerstein, Murray & Bo-un 

B STREET: 6300 Sears Tower, 233 South Wacker Drive 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: United States of America 

(F) ZIP: 60606-G402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version fll.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER- 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 
(A) APPLICATION NUMBER- 
<B) FILING DATE: 

(viii) ATTORNEY/ AGENT INFORMATION- 

(A) NAME : 

(B) REGISTRATION NUMBER: 

(C) REFERENCE/DOCKET NUMBER: 27866/33424 

(ix) TELECOMMUNICATION INFORMATION- 

(A) TELEPHONE: 312/4 74-6300 

(B) TELEFAX: 312/474-0448 

(C) TELEX: 2S-3856 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
TTGGTGAGCG CTAGGAGTCA CTGCCAG 
12) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 43 base pairs 

(B) TYPE.- nucleic acid 
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(C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

TATACTCTAT CAATGATAGA GTAATTCATT ATGTGATAAT GCC 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS - 
(A) LENGTH : 42 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 3: 
ATTACTCTAT CATTGATAGA GTATATAAAG TAATGTGATT TC 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i> SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
AATTCTGCTA GCCTCTGCAA AGC 
(2) INFORMATION FOR SEQ ID NO: 5: 

!i> SEQUENCE CHARACTERISTICS - 

(A) LENGTH : 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:S: 
CGCACGCGTC GAAGAAATCA CATTACTTTA TATA 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 32 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CGCACGCGTA TACTAAAAAA TGAGCAGGCA AG 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS • 
<A) LENGTH : 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CGCGTACTCT ATCATTGATA GAGTA 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
ATGAGATAGT AACTATCTCA TGCGC 
(2) INFORMATION FOR SEQ ID NO: 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
CGCGTACTCT ATCATTGATA GAGTCTAGAC TCTATCAATG ATAGAGTA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
GCGACGCGTG CATGCCGTCT TCAAGAATTC CTCGAG 
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<2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCGACGCGTG CATGCCCACC GTACACGCCT ACTCGA 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CATGGCATGC AAAAAAAAAG AGTCATCCGC TAGG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION.- SEQ ID NO:13: 
CATGGCATGC TTAGCGATTG GCATTATCAC AT 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: sinqle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
TAATACGACT CACTATATAG GG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 
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(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: IS: 
TCTAGACTTT GCCTTCGTTT ATC 
(2) INFORMATION FOR SEQ ID NO: 16: 

<i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGAAGGCAAA GATGTCTAGA TTAGATAAAA G 
(2) INFORMATION FOR SEQ ID NO: 17: 

<i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 49 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 17: 

CGCGGATCCG CTTTCTCTTC TTTTTTGGAG ACCCACTTTC ACATTTAAG 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS- 
<A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AATTGCTCGA GTACTGTATG TACATACAGT AG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 



-- — -' single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 
AATTCTACTG TATGTACATA CAGTACTCGA GC 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CCGGAATTCT CGAGACATAT C CAT ATCT AA TC 
(2) INFORMATION FOR SEQ ID NO: 21: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CCGGAATTCA CTAATCGCAT TATCATC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CATGCCATGG CCATGTCTAG ATTAGATAAA AG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCB CHARACTERISTICS • 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

GCGAATTCGC CAGGGCAACA GAATGCCACT 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS- 
(A) LENGTH: 32 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGGGATCCTG GCTGGTTACC CAGGATGCCT TG 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CGCGGATCCG GATGACCATG GACTCTGGAG 



(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2G: 
CGCGGATCCT TAATCTGACT TGTGGCAGTA 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CGCGGATCCC CATGACCATG GAATCTGGAG CC 



(2) INFORMATION FOR SEQ ID N0:28: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CGCGGATCCG TGCTGCTTCT TCAGCAGGCT G 

(2) INFORMATION FOR SEQ ID NO:29: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) type, nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
ATGGTACCAG CGGCCGCTAG TCGTTTTACA ACGTCGTGAC 
(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 
ATGGTACCGC GGCCGCTTAT TTTTGACACC AGACCAAC 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:31 : 
CGGAGATCTA AAGAGACTTT TCTCCGGAAC TCAG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.32: 

CGGAGATCTT TACAGGAAGA CTGAACTGT 
29 
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(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 33: 

CCACCGCGGC AGTGCCAACC CCGATTTAC 
29 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS- 
<A) LENGTH: 2B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 
CATCCGCGGT GGTGATGGCA GGGGCTGA 



(2) INFORMATION FOR SEQ ID NO: 35: 

<i> SEQUENCE CHARACTERISTICS - 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 
GGCTATCGAT ACGGCCCCCC CGACCGAT 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GCGTATCGAT CTACCCACCG TACTCGTC 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
CCTACTCTTA GGCCCGGGTC TTTTTAATGT ATCC 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
GGAATCACTA CAGGGATG 



(2) INFORMATION FOR SEQ ID NO: 39 ; 

<i» SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 1485 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

ATGGACTTAA GAGTAGGAAG GAAATTTCGT ATTGGCAGGA AGATTGGGAG TGGTTCCTTT 60 

GGTGACATTT ACCACGGCAC GAACTTAATT AGTGGTGAAG AAGTAGCCAT CAAGCTGGAA 120 

TCGATCAGGT CCAGACATCC TCAATTGGAC TATGAGTCCC GCGTCTACAG ATACTTAAGC 180 

GGTGGTGTGG GAATCCCGTT CATCAGATGG TTTGG C AG AG AGGGTGAATA TAATGCTATG 240 

GTCATCGATC TTCTAGGCCC ATCTTTGGAA GATTTATTCA ACTACTGTCA CAGAAGGTTC 

TCCTTTAAGA CGGTTATCAT GCTGGCTTTG CAAATGTTTT GCCGTATTCA GTATATACAT 360 

GGAAGGTCGT TCATTCATAG AGATATCAAA CCAGACAACT TTTTAATGGG GGTAGGACGC 4 20 

CGTGGTAGCA CCGTTCATGT TATTGATTTC GGTCTATCAA AGAAATACCG AGATTTCAAC 

ACACATCGTC ATATTCCTTA CAGGGAGAAC AAGTCCTTGA CAGGTACAGC TCGTTATGCA 540 

AGTGTCAATA CGCATCTTGG AATAGAGCAA AGTAGAAGAG ATGACTTAGA ATCACTAGGT 60 0 

TATGTCTTGA TCTATTTITG TAAGGGTTCT TTG CCATGG C AGGGTTTGAA AGCAACCACC 660 

AAGAAACAAA AGTATGATCG TATCATGGAA AAGAAATTAA ACGTTAGCGT GGAAACTCTA 720 

TGTTCAGGTT TACCATTAGA GTTTCAAGAA TATATGG CTT ACTGTAAGAA TTTGAAATTC 780 

GATGAGAAGC CAGATTATTT GTTCTTGGCA AGGCTGTTTA AAGATCTGAG TATTAAACTA 840 

GAGTATCACA ACGACCACTT GTTCGATTGG ACAATGTTGC GTTACACAAA GGCGATGGTG 900 

GAGAAGCAAA GGGACCTCCT CATCGAAAAA GGTGATTTGA ACGCAAATAG CAATGCAGCA 960 

AGTGCAAGTA AC AGCACAGA CAACAAGTCT GAAACTTTCA ACAAGATTAA ACTGTTAGCC X020 

ATGAAGAAAT TCCCCACCCA TTTCCACTAT TACAAGAATG AAGACAAACA TAATCCTTCA 108 O 

CCAGAAGAGA TCAAACAACA AACTATCTTG AATAATAATG CAGCCTCTTC TTTACCAGAG 1140 

GAATTATTGA ACG CACTAG A TAAAGGTATG GAAAACTTGA GACAACAGCA GCCGCAGCAG 1200 

CAGGTCCAAA GTTCGCAGCC ACAACCACAG CCCCAACAGC TACAGCAGCA ACCAAATGGC 1260 

CAAAGACCAA ATTATTATCC TGAACCGTTA CTACAGCAGC AACAAAGAGA TTCTCAGGAG 1320 

CAACAGCAGC AAGTTCCGAT GGCTACAACC AGGGCTACTC AGTATCCCCC ACAAATAAAC 1380 

AGCAATAATT TTAATACTAA TCAAGCATCT GTACCTCCAC AAATGAGATC TAATCCACAA 1440 

CAGCCGCCTC AAGATAAACC AGCTGGCCAG TCAATTTGGT TGTAA 1485 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 2625 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : sinqle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 796.. 2580 



(xi) SI 


JQOENCE DESCRIPTION: SI 


iQ ID NO.-40: 






CATTTTCTTA 


ATTCTTTTAT 


GTGCTTTTAC 


TACTTTGTTT AGTTCAAAAC 


AATAGTCGTT 


60 


ATTCTTAGGT 


ACTATAGCAT 


AAGACAAGAA 


AAGAAAAATA AGGGACAAAT 


AACATTAGCA 


120 


GAAGTACGGT 


ATATTTTACT 


GTTACTTATA 


TACTTTCAAG AAGATGAGTT 


AAATCGGTAG 


iao 


CCAGTGTAGA 


AAAATAATAA 


TAAGGGTCAT 


CGATCCTTCG CATTTTATTA 


TCCAATTAAA 




GATACGAATC 


ACGGCAAACT 


ATATTCAAAG 


CTCATAGATA ATCGTCGTAA 


GGCTGACACT 


300 


GCAGAAGAAA 


AGTCATAATT 


TGAATACTAG 


CCGGTATGAA ACTGTGATTG 


ATTAACCTGG 


360 


GGTTACCTAA 


AGAGAACATA 


AGTAATACTC 


ATGACAGAAT CAAAACACAA 


TACAAAATTT 


420 


ATCCGAACCT 


CGGCCCGACT 


GCGGCTCGCC 


GGGAAAGGGG ACAACCGCTT 


CTATCCGTCG 


480 


ACTAACTTCA 


TCGGCCCAAT 


GGAAGCTATG 


ATATGGGGAT TTCCATTGAG 


CCGATAGCAA 


540 


TGTAGGGTAA 


TACTGTTGCG 


TATATAGTGA 


TAGTTATTGA ATTTTATTAC 


CCTGCGGGAA 


600 


TATTGAGACA 


TCACTAAGCA 


CGAATTTTAC 


GTCTGAGGAA AGTTGAATGA 


TGGCCAAATA 


660 


ACCAGGAAAA 


ACAAATATTG 


AATCCTTGTG 


AAGGATTCCA CAGTTGTTTA 


ATCCTCCTTA 


720 


AGCTCACTTA 


GTATCAATTG 


TCTAAATAAT 


ATTGCTTTGA ATCTGAAAAA 


AATAAAAGTA 


780 


CCTTCGCATT 


AGACA ATG TCA CTG CCG 
Met Ser Leu Pro 


CTA CGA CAC GCA TTG GAG AAC GTT 
Leu Arg His Ala Leu Glu Asn Val 


831 



1 5 10 

ACT TCT GTT GAT AGA ATT TTA GAG GAC TTA TTA GTA CGT TTT ATT ATA 879 
Thr Ser Val Asp Arg lie Leu Glu Asp Leu Leu Val Arg Phe lie lie 
15 20 25 

AAT TGT CCG AAT GAA GAT TTA TCG AGT GTC GAG AGA GAG TTA TTT CAT 927 
Asn Cys Pro Asn Glu Asp Leu Ser Ser Val Glu Arg Glu Leu Phe His 
30 35 40 

TTT GAA GAA GCC TCA TGG TTT TAC ACG GAT TTC ATC AAA TTG ATG AAT 975 
Phe Glu Glu Ala Ser Trp Phe Tyr Thr Asp Phe lie Lys Leu Met Asn 
4 5 50 55 60 

CCA ACT TTA CCC TCC CTA AAG ATT AAA TCA TTT GCT CAA TTG ATC ATA 1023 
Pro Thr Leu Pro Ser Leu Lys lie Lys Ser Phe Ala Gin Leu lie lie 
65 70 75 

AAA CTA TGT CCT CTG GTT TGG AAA TGG GAC ATA AGA GTG GAT GAG GCA 1071 
Lys Leu Cys Pro Leu Val Trp Lys Trp Asp He Arg Val Asp Glu Ala 
80 85 90 

CTC CAG CAA TTC TCC AAG TAT AAG AAA AGT ATA CCG GTG AGG GGC GCT 1119 
Leu Gin Gin Phe Ser Lys Tyr Lys Lys Ser He Pro Val Arg Gly Ala 
95 100 105 



GCC ATA TTT AAC GAG AAC CTG AGT AAA ATT TTA TTG GTA CAG GGT ACT 
Ala He Phe Asn Glu Asn Leu Ser Lys He Leu Leu Val Gin Gly Thr 



1167 
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H5 120 

GAA TCG GAT TCT TTG TCA TTC CCA AGG GGG AAG ATA TCT AAA GAT GAA 
Glu Ser Asp Ser Leu Ser Phe Pro Arg Gly Lys He Ser Lys lip gT£ 
130 135 i4 0 

GAC ATA GAT TGT TGC ATT AGA GAA GTG AAA GAA GAA ATT GGT TTC 
Asn Asp He Asp Cys Cys lie Arg Glu Val Lys Glu Glu lie Gly ™e 
145 150 155 

GAT TTG ACG GAC TAT ATT GAC GAC AAC CAA TTC ATT GAA AGA AAT ATT 
Asp Leu Thr Asp Tyr lie Asp Asp Asn Gin Phe He Glu Arg Asn III 
160 165 170 

CAA GGT AAA AAT TAC AAA ATA TTT TTG ATA TCT GGT GTT TCA GAA GTC 
Gin Gly Lys Asn Tyr Lys He Phe Leu Xle Ser G i y ^ ™ ™ £ C 

3 180 185 

TTC AAT TTT AAA CCT CAA GTT AGA AAT GAA ATT GAT AAG ATA GAA TGG 
Phe Asn Phe Lys Pro Gin Val Arg Asn Glu He Asp Lys lie G^ 

P^o Aco £7 f* 6 ^ ^ TCT *** ACA ATG TAC ^ TCA AAT ATC AAG 

Phe Asp Phe Lys Lys lie Ser Lys Thr Met Tyr Lys Ser Asn lie Lys 
Z1 ° 2 15 220 

Sr £r ill r C ATC ATC AGA CCC ™ TCA ATG TGG TTA AGG 

Tyr Tyr Leu He Asn Ser Met Met Arg Pro Leu Ser Met Trp Leu Arg 
225 230 235 

£is G^n a™ 55* tT A ?** *** GAA GAT CAA TTG AAA TCC TAT GCG GAA 
HiS Gln ^ Ile y B ^n Glu Asp Gin Leu Lys Ser Tyr Ala Glu 

240 245 250 

r^fi AAA TTG TTG TTG GGT ATC ACT AAG GAG GAG CAG ATT GAT 

Glu Gin Leu Lys Leu Leu Leu Gly lie Thr Lys Glu Glu Gin lH 1% 
" 5 260 265 

Pro G^v irn p?" TTG CTG AAT ATG TTA CAT ACT GCA GTG CAA GCT AAC 
Pro Gly Arg Glu Leu Leu Asn Met Leu His Thr Ala Val Gin Ala Asn 
275 280 

25 j£» £n ^ S*? TCC ^ <* G GTA CCC TCG AGC CAA GAG 

Ser Asn Asn Asn Ala Val Ser Asn Gly Gin Val Pro Ser Ser Gin Glu 
290 295 300 

Leu r^n ^*A GAG CAA TCA GGA GAA CAC AAC CAA CAG AAG GAT 

Leu Gin His Leu Lys Glu Gin Ser Gly Glu His Asn Gin G^ £2 ™£ 
lvi > 310 3i 5 

a a s s 5Esaas:s22ESE 

J2W 325 33 0 

CTT TCT GAA CCG TTT GCT AAC AAT AAG AAT GTT ATA CCA CCT ACT ATP 
Leu Ser Gl u Pro Phe Ala Asn Asn Lys Asn SaT S S S £ £? 
JJ5 340 345 

Pro mI° *7 ?7 A TTC ATC TCA AAT CCT CAA TTG TTT GCG ACA ATG 

Pro Met Ala Asn Val Phe Met Ser Asn Pro Gin Leu Phe Ala Tn7 £? 

355 360 



AAT GGC CAG CCT TTT GCA CCT 



TTC CCA TTT ATG TTA CCA TTA ACT AAC 
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Asn Gly Gin Pro Phe Ala 
365 370 

AAT ACT AAT AGC GCT AAC 
Asn Ser Asn Ser Ala Asn 
385 

AAT GCT CCT CCG AAT CCG 
Asn Ala Pro Pro Asn Pro 
400 

CTT TCT GGA CCA GCA GTA 
Leu Ser Gly Pro Ala Val 
415 

TTA CCG AGG GAC TCT GGT 
Leu Pro Arg Asp Ser Gly 
430 

GAT ATA CTA AAT TCG AAA 
Asp He Leu Asn Ser Lys 
445 4 | 0 

AAG CCA AAG CTT AAA ATC 
Lys Pro Lys Leu Lys He 
465 



AAG CAA AAC AAT AAT GAT 
Lys Gin Asn Asn Asn Asp 
4 80 

CTA GAT TTG TTG AAA AAA 
Leu Asp Leu Leu Lys Lys 
495 

AAA CCA GAT ACT TCC TTT 
Lys Pro Asp Thr Ser Phe 
510 

GAT GCA GAA TAT GAA GAT 
Asp Ala Glu Tyr Glu Asp 
525 S30 

ACA GCT AGA GAT GAA AGA 
Thr Ala Arg Asp Glu Arg 
545 

GTT ATG CCA AGC GAA AAA 
Val Met Pro Ser Glu Lys 
560 

AGG AAC GAC GCA AGC AAA 
Arg Asn Asp Ala Ser Lys 
575 

ACT GTA GAA TGG GGG GCT 
Ser Val Glu Trp Gly Ala 
590 

ACAGAATCCA CAGTA 



Pro Phe Pro Phe Met Leu Pro Leu Thr Asn 
3" 3 8 o 

p^I iV S CA ACT CCG 0X0 CCC CCT AAT TTT 
Pro lie Pro Thr Pro Val Pro Pro Asn Phe 
390 

ATG GCT TTT GGT GTT CCA AAC ATG CAT AAC 
Met Ala Phe Gly Val Pro Asn Met h7s Asn 
405 410 

TCT CAA CCG TTT TCC TTG CCT CCT GCT CCT 
Ser Gin Pro Phe Ser Leu Pro Pro Ala Pro 
420 425 

TAC AGC AGC TCC TCC CCT GGG CAG TTG TTA 
Tyr Ser Ser Ser Ser Pro Gly Gin Leu Leu 
435 440 

AAG CCT GAC AGC AAC GTG CAA TCA AGC AAA 
Lye Pro Asp Ser Asn Val Gin Ser Ser Lys 
455 460 
TTA CAG AGA GGA ACG GAC TTG AAT TCA CTC 
Leu Gin Arg Gly Thr Asp Leu Asn Ser Leu 
470 475 

GAA ACT GCT CAT TCA AAC TCT CAA GCT TTG 
Glu Thr Ala His Ser Asn Ser Gin Ala Leu 
485 490 

CCA ACA TCA TCG CAG AAG ATA CAC GCT TCC 
Pro Thr Ser Ser Gin Lys He His Ala Ser 
S00 505 

TTA CCA AAT GAC TCC GTA TCT GGT ATA CAA 
Leu Pro Asn Asp Ser Val Ser Gly He Gin 
515 520 

TTC GAG ACT AGT TCA GAT GAA GAG GTG GAG 
Phe Glu Ser Ser Ser Asp Glu Glu Val Glu 
535 540 

AAT TCA TTG AAT GTA GAT ATT GGG GTG AAC 
Asn Ser Leu Asn Val Asp lie Gly Val Asn 

550 555 
GAC AGC CGA AGA AGT CAA AAG GAA AAA CCA 
Asp Ser Arg Arg Ser Gin Lys Glu Lys Pro 
565 570 

ACA AAC TTG AAC GCT TCT GCA GAA TCT AAT 
Thr Asn Leu Asn Ala Ser Ala Glu Ser Asn 
580 585 

GGG TAAATCTTCA CCCTCCGACT TCAGAGTAAC 
595 
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(2) INFORMATION FOR SEQ ID NO.-41: 

(i) SEQUENCE CHARACTERISTICS • 

(A) LENGTH: 6854 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2050.. 4053 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:4I: 
AGCTTCTCCC TTTTCCTTCA GTGCTGCTAC TCTCTGCTCT CCACTTAAGT GTTACAATTA 60 
ATTTGCAG CT AGTTTGCAGT TCGTACAACC TCGCCTATTC TTGTAACGAA GAAGAACGTA 120 
TTTATAATAT TGGGCTGTAA TGTGTTGAGT TTAGTAATAG ATAAAGTAGG ACAGAGTTCT 18 0 
GTCTTTGTTT ATCTATGGGG TTCAGAGTGA TAAGGGGCAG GATAAGGAAG TTAAAAAAAA 240 
AAAGGTTACG TTATATAACG AAAGAAAAGA AACGAGCGAA GTGCCAACTA TAGCCCAATA 300 
TCAAGAATGC AAGTCAGCAA AGTACAGTAA TCGTATGAAG ATACGCGATG CGTAATATCC 360 
CTCAAGGGCT CCGGATCAGA AAAGCTAAGG GAAGATCCTT ACATTACACG GCGTGCGACA 420 
GACTCGAACC ACAGCTAACT TCTCGTGAAA AGATGGCTTC AACTTCGCTC TTGCAATAAC 4 80 

TTTGAAACAC ACGAACAAAG GTTTATTGCG CTTGATTAAC GTTGGAAGTA TATGATACTA 54 0 

ATACTACTTT GTTCTCTAAG TCATCGCTAT ATGTTTATCT CGAGGAAAAG GTGCACGGCG 600 
GTACACAATT ACTTCGCCGT TTCGGGTAAA ACAAGTGTTA CATTTATAAT ATATATGTAT 660 
ATATGTATGT GCGCGTAAGT ATATGCCGTT CATAACAAAT CATCTTCTTG TTGCTGGATG 
GACTCCTTAA TTTTATTCAA AATGGTAATT TTCCATTTAT CTAGTCTCAT AAAATTGTCA 
AACTCCTTAC AGTGTTCGCT TAGCTGCTCG CTATCACCTT CATTAACAGC ATCGATTAAA 840 
CTTTTCAAGA AATTTGACTC CCTTGAATCC GCAAAATTCG GATCTTCACT TTGACCCTCT 900 
TGTAAAGTTC TTGCAGCAGC GACTGCATCA GTAGCAGCTA GCTGACAAAG CC CTTTTTrT 
AGGAAGTAAT CCTTCAAACT CCATTGGCTC AATCTATTGC CCATGCTGCT CTTGATCAAC 
TTCGAATATA TATCACTTGC TTCAATATAT TGACCGTCAA GAGCCTTTAG ATCTGCGCAT 
TTGATAAAAC ACTTATTCGA TAATGCTACC GACTGGTCTT GGGCATACCA CTCACCAGCG 
AGCTCATAGC AATCTATAGC TTTTGCATAG TCATGCAAAT CATTTTCTAG AATTTCTCCA 1200 
AGCTCAAACT TGAAATTAGC ACCTCTCCGG AACTGCCCCC TATGAGTAAA AATTTGAATA 1260 
GCATTTTCTA ATGAATCCAC GGCGTTCACA GAGTTTCCAC CGCTTTTAAA GCATTTATAA 
GCCTCTACGT AGGTATTTCC TGCTTCGTCT TCATTACCAG CCTTTTTCTG ATAGTCAGCA 



720 



960 
1020 
1080 
1140 



1320 
1380 
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GCTTTCAAAA ACGAGTCTCC TGCCAAGTTT AACTCTTTTC TTAGACGGTA AATGGTGGCT 
GCTTGGACAC AAAGATCAGC AGCCTCCTCA AACTTCTATG AATCAGAACC GCTAAACAAT 
TTCATGAAAC CCGATGAAGG AACACCCTTC TTCTCAGCCT TAACACAACG GGAAATATCA 
ATTCCCGTAT TTCAATGTTA GTAATTTGCC TTCGTAAATT ACGGAATCAC ATAGCTTTCA 

m-TGrrccr TTGATATATT TCCCTACTAC ATACTCTITr caataactct acagggtctc 

ACATTTTTAA CTTTCAGGTT AATGATGGTG TTCTTACTAT ATTCTCGAGT CGTACAGAAG 

TTAGTTCAGA TAAACTGCTT cggtgctgcc cacttcitat cattacttca actttacctt 

CCCTATACCT GTGTGTCCTT ATTAATTCAA GTTAATCCGA GGTAATAGAT TAGGGTAACC 
rrCAATGATG TCACGAAACA CGGATGCTGC AACTTTGCGA TTTTTTCCTG GAAAAGAATA 
ACAATTAAAG GCAGCCTTTC AGCTGAGATT ACCAGCAGGT CTTTGGAGAT TAGCGCAAGA 
AGAAGTGTGA TATAGTACTC ATAGAGGCAG GCTACAGACT AGGGAAAGCG TGTTCAACAA 
CAATAAGAA ATG GAG ACC ACT TCT TTT GAG AAT GCT CCT CCT GCA GCC 
Met Glu Thr Ser Ser Phe Glu Asn Ala Pro Pro Ala Ala 
5 10 

~ rl " - - - ^ s s - £ K £ £ 2S a 
si s z a KEsassssssss 

40 4 5 

2 SJ S §K £ iX S S? SS £ ESSE S Si 
Si K EEESS2 X S E SE S £ J" S 

ESSSJSSa-; » « s s Si Si s s 
S^SE E SESE ?E ? c T " c 08r »" ™ 

110 Y Vi? 7X1 ■*« *•* 01 V U= 

5 120 125 

552S2SSK£S5- S ££s: 

135 140 
" b 150 15 5 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2088 
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GTG GAA TAT AAA AAA ATG CTT CCC CAR GCT GAA AGA GAA AGA ATC GAG 
Val Glu Tyr Lys Lys Met Leu Pro Gin Ala Glu Arg Glu Arg lie Glu 
160 165 170 

AGG GAG AAG AGA GAG AAA AGA GGA CAR TTA GAA GAA CAA CAC AGA TCG 
Arg Glu Lys Arg Glu Lys Arg Gly Gin Leu Glu Glu Gin His Arg Ser 
175 180 185 

TCA TCT AAT CTT TCT TTG GAT TCT TTA TCT AAA ATG AGT GGA AGC GGA 
Ser Ser Asn Leu Ser Leu Asp Ser Leu Ser Lys Met Ser Gly Ser Gly 
190 195 200 205 

AAC AAT AAT ACT TCT AAC AAT CAA TTA TTC TCG ACT CTA ATG AAC GGC 
Asn Asn Asn Thr Ser Asn Asn Gin Leu Phe Ser Thr Leu Met Asn Gly 
210 215 220 

ATT AAT GCT AAT AGC ATG ATG AAC AGT CCA ATG AAT AAT ACC ATT AAC 
lie Asn Ala Asn Ser Met Met Asn Ser Pro Met Asn Asn Thr lie Asn 
225 230 235 

AAT AAC AGT TCT AAT AAC AAC AAT AGT GGT AAC ATC ATT CTG AAC CAA 
Asn Asn Ser Ser Asn Asn Asn Asn Ser Gly Asn lie lie Leu Asn Gin 
240 245 250 

CCT TCA CTT TCT GCC CAA CAT ACT TCT TCA TCG TTG TAC CAA ACA AAC 
Pro Ser Leu Ser Ala Gin His Thr Ser Ser Ser Leu Tyr Gin Thr Asn 
255 260 265 

GTT AAT AAT CAA GCC CAG ATG TCC ACT GAG AGA TTT TAT GCG CCT TTA 
Val Asn Asn Gin Ala Gin Met Ser Thr Glu Arg Phe Tyr Ala Pro Leu 
270 275 280 285 

CCA TCA ACT TCC ACT TTG CCT CTC CCA CCC CAA CAA CTG GAC TTC AAT 
Pro Ser Thr Ser Thr Leu Pro Leu Pro Pro Gin Gin Leu Asp Phe Asn 
290 295 300 

GAC CCT GAC ACT TTG GAA ATT TAT TCC CAA TTA TTG TTA TTT AAG GAT 
Asp Pro Asp Thr Leu Glu lie Tyr Ser Gin Leu Leu Leu Phe Lys Asp 
305 310 315 

AGA GAA AAG TAT TAT TAC GAG TTG GCT TAT CCC ATG GGT ATA TCC GCT 
Arg Glu Lys Tyr Tyr Tyr Glu Leu Ala Tyr Pro Met Gly lie Ser Ala 
320 325 330 

TCC CAC AAG AGA ATT ATC AAT GTT TTG TGC TCG TAC TTA GGG CTA GTA 
Ser His Lys Arg He He Asn Val Leu Cys Ser Tyr Leu Gly Leu Val 
335 340 * 345 

GAA GTA TAT GAT CCA AGA TTT ATT ATT ATC AGA AGA AAG ATT CTG GAT 
Glu Val Tyr Asp Pro Arg Phe He He He Arg Arg Lys He Leu Asp 
350 355 360 365 

CAT GCT AAT TTA CAA TCT CAT TTG CAA CAA CAA GGT CAA ATG ACA TCT 
His Ala Asn Leu Gin Ser His Leu Gin Gin Gin Gly Gin Met Thr Ser 
370 375 380 

GCT CAT CCT TTG CAG CCA AAC TCC ACT GGC GGC TCC ATG AAT AGG TCA 
Ala His Pro Leu Gin Pro Asn Ser Thr Gly Gly Ser Met Asn Arg Ser 
385 390 395 

CAA TCT TAT ACA AGT TTG TTA CAG GCC CAT GCA GCA GCT GCA GCG AAT 
Gin Ser Tyr Thr Ser Leu Leu Gin Ala His Ala Ala Ala Ala Ala Asn 
400 405 410 
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AGT ATT AGC AAT CAG GCC GTT AAC AAT TCT TCC AAC AGC AAT ACT ATT 
Ser He Ser Asn Gin Ala Val Asn Asn Ser Ser Asn Ser Asn Thr He 
415 420 425 

AAC AGT AAT AAC GGT AAC GGT AAC AAT GTC ATC ATT AAT AAC AAT AGC 
Asn Ser Asn Asn Gly Asn Gly Asn Asn Val He He Asn Asn Asn Ser 
430 435 440 445 

GCC AGC TCA ACA CCA AAA ATT TCT TCA CAG GGA CAA TTC TCC ATG CAA 
Ala Ser Ser Thr Pro Lys He Ser Ser Gin Gly Gin Phe Ser Met Gin 
4S0 455 460 

CCA ACA CTA ACC TCA CCT AAA ATG AAC ATA CAC CAT AGT TCT CAA TAC 
Pro Thr Leu Thr Ser Pro Lys Met Asn He His His Ser Ser Gin Tvr 
465 470 475 

AAT TCC GCA GAC CAA CCG CAA CAA CCT CAA CCA CAA ACA CAG CAA AAT 
Asn Ser Ala Asp Gin Pro Gin Gin Pro Gin Pro Gin Thr Gin Gin Asn 
480 485 490 

GTT CAG TCA GCT GCG CAA CAA CAA CAA TCT TTT TTA AGA CAA CAA GCT 
Val Gin Ser Ala Ala Gin Gin Gin Gin Ser Phe Leu Arg Gin Gin Ala 
495 500 505 

ACT TTA ACA CCA TCC TCA AGA ATT CCA TCC GGT TAT TCT GCC AAC CAT 
Thr Leu Thr Pro Ser Ser Arg He Pro Ser Gly Tyr Ser Ala Asn His 
510 515 520 525 

TAT CAA ATC AAT TCC GTT AAT CCC TTA CTG AGA AAT TCT CAA ATT TCA 
Tyr Gin He Asn Ser Val Asn Pro Leu Leu Arg Asn Ser Gin He Ser 
530 535 540 

CCT CCA AAT TCA CAA ATC CCA ATC AAC AGC CAA ACC CTA TCC CAA GCG 
Pro Pro Asn Ser Gin He Pro He Asn Ser Gin Thr Leu Ser Gin Ala 
545 550 555 

CAA CCA CCA GCA CAG TCC CAA ACT CAA CAA CGG GTA CCA GTG GCA TAC 
Gin Pro Pro Ala Gin Ser Gin Thr Gin Gin Arg Val Pro Val Ala Tvr 
560 565 570 

CAA AAT GCT TCA TTG TCT TCC CAG CAG TTG TAC AAC CTT AAC GGC CCA 
Gin Asn Ala Ser Leu Ser Ser Gin Gin Leu Tyr Asn Leu Asn Gly Pro 
575 580 585 

TCT TCA GCA AAC TCA CAG TCC CAA CTG CTT CCA CAG CAC ACA AAT GGC 
Ser Ser Ala Asn Ser Gin Ser Gin Leu Leu Pro Gin His Thr Asn Gly 
590 595 600 6 05 

TCA GTA CAT TCT AAT TTC TCA TAT CAG TCT TAT CAC GAT GAG TCC ATG 
Ser Val His Ser Asn Phe Ser Tyr Gin Ser Tyr His Asp Glu Ser Met 
610 615 620 

TTG TCC GCA CAC AAT TTG AAT AGT GCC GAC TTG ATC TAT AAA TCT TTG 
Leu Ser Ala His Asn Leu Asn Ser Ala Asp Leu He Tyr Lys Ser Leu 
62 5 630 635 

AGT CAC TCT GGA CTA GAT GAT GGC TTG GAA CAG GGC TTG AAT CGT TCT 
Ser His Ser Gly Leu Asp Asp Gly Leu Glu Gin Gly Leu Asn Arq Ser 
640 645 650 

TTA AGC GGA CTG GAT TTA CAA AAC CAA AAC AAG AAG AAT CTA TCG 
Leu Ser Gly Leu Asp Leu Gin Asn Gin Asn Lys Lys Asn Leu Trp 
655 660 665 
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TAATATATAC TTCCATTATT CTATGATTAT AGAGTTTGTT TGGTATTTGT ATATCGCACG 4113 

ATACAAGTAA TGAGGGGTGC TTACACAAGA TAAAAGATAA AAAAATATAT ATATATAATA 4173 

AAAACCATCA AAAACACCAT TGAAAAAAAA TATAAAAAAA AAAAAAAATA ACCGAATATG 4233 

AATATGAAAT TAATGATCAT GATGAAGTTA ATTTTTACTG AGAAACGTCA CCTAATGTCG 4293 

ATGAAACGAT GATAATGAAT GAATGATGAG GCTACTTTAA GTAACGCAAT GTAATCAAGC 4 3S3 

CAAAATTATC CCTCTTTTTT TTTTTTCCCT CTTTTGAGAT TTTATTTTTA ACCTACTACT 4 413 

TA CTTTTTTT TTTTGAACGT TCTTTTCCCA CATACTTTTA TATATGGTAT TTATATGTAC 44 73 

GATGTTTAAT CACAGAGATG TTTCTACCTT ACTCGATATT GTTTTTGCAT TAATTGATAT 4533 

CTTGCTCACT GCATCATTGG CGGTATTTGT AGTATATAGA AAGTCGGGTA ACAATAATTT 4593 

ATTGACATTT CTTTGTTTAC AATGATCAGA GAAGAGCAGA AAGTTTCATA GTCAAACGT" 4 653 

CAGGCCAATT GAACAAGAAA TTATTCGTTT TTTTAGTCGT TGAGTGTTCA ACTGACATGC 4713 

TATTTTGGTG GTTCTTGATT AATTGGGGGC TTCATTGTTT GAAATAAAGA GTCGGGAAAA 4773 

TAGCACAGAA ACAAAGCATA TTAAAAGAGG CAAAAGAAGA AAGAACGAAT ATAAAAGGTA 4833 

AAAAAGGAAA AGCATTGCTA TTCTTTTCTC AT AG GTGTTA TTCATACCGC CCTCTCTCTT 4 89 3 

CTTCCTTCTT CATTAATTAG TCTCCGTATA ATTTGCAGAT AATGTCATTA ACAGCAAACG 4953 

ACGAATCGCC AAAACCCAAA AAAAATGCAT TATTGAAAAA CTTAGAGATC GATGATCTGA 5013 

TACATTCTCA ATTTGTCAGA AGCGATACAA ATGGACATAG AACTACAAGA CGACTATTCA 5073 

ACTCCGATGC CAGTATATCA CATCGAATAA GAGGAAGTGT TCGGTCTGAT AAAGGCCTTA 5133 

ATAAAATAAA AAAAGGGTTG ATTTCCCAGC AGTCCAAACT TGCGTCAGAA AATTCTTCTC: 5193 

AAAATATCGT TAATAGGGAC AATAAGATGG GAGCAGTAAG TTTCCCCATT ATTGAACCTA 5253 

ATATTGAAGT CAGCGAGGAG TTGAAGGTTA GAATTAAGTA TGATTCTATC AAATTTTTCA 5313 

ATTTTGAAAG ACTAATATCT AAATCTTCAG TCATAGCACC TTTAGTTAAC AAAAATATAA 5373 

CATCATCCGG TCCTCTAATC GGGTTTCAAA GAAGAGTTAA CAGGTTAAAG CAAACATGGG 5433 

ATCTAGCAAC CGAAAACATG GAGTACCCAT ATTCTTCTGA TAATACG CCA TTCAGGGATA 549 3 

ACGATTCTTG GCAATGGTAC GTACCATACG GCGGAACAAT AAAAAAAATG AAAGATTTCA 5553 

GTACAAAAAG AACTTTACCC ACCTGGGAAG ATAAAATAAA GTTTCTTACA TTTTTAGAAf. 5613 

ACTCTAAGTC TGCAACGTAC ATTAATGGTA ACGTATCACT TTGCAATCAT AATGAAACCC- 5673 

ATCAAGAAAA CGAAGATAGG AAAAAAAGGA AAGGGAAAGT ACCAAGAATC AAAAATAAAC- 5733 

TGTGGTTTTC CCAGATAGAA TACATTGTTC TTCGAAATTA TGAAATTAAA CCTTGGTATA 5793 

CATCTCCTTT TCCGGAACAC ATCAACCAAA ATAAAATGGT TTTTATATGT GAGTTCTGCC 5853 

TAAAATATAT GACTTCTCGA TATACTTTTT ATAGACACCA ACTAAAGTGT CTAACTTTTA 5913 

AGCCCCCCGG AAATGAAATT TATCGCGACG GTAAGCTGTC TGTTTGGGAA ATTGATGGGC 5973 
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GGGAGAATGT CTTGTATTGT CAAAATCTTT GCCTGTTGGC AAAATGTTTT ATCAATTCTA 6033 

AGACTTTGTA TTACGATGTT GAACCGTTTA TATTCTATAT TCTAACGGAG AGAGAGGATA 6093 

CAGAGAACCA TCCCTATCAA AACGCAGCCA AATTCCATTT CGTAGGCTAT TTCTCCAAGG 6153 

AAAAATTCAA CTCCAATGAC TATAACCTAA GTTGTATTTT AACTCTACCC ATATACCAGA 6213 

GGAAAGGATA TGGTCAGTTT TTGATGGAAT TTTCATATTT ATTATCCAGA AAGGAGTCAA 6273 

AATTTGGAAC TCCTGAAAAA CCATTGTCGG ATTTAGGATT ATTGACTTAC AGAACGTTTT 63 33 

GGAAGATAAA ATGTGCTGAA GTGCTATTAA AATTAAGAGA CAGTGCTAGA CGTCGATCAA 6393 

ATAATAAAAA TGAAGATACT TTTCAGCAGG TTAGCCTAAA CGATATCGCT AAACTAACAG 6453 

GAATGATACC AACAGACGTT GTGTTTGGAT TGGAACAACT TCAAGTTTTG TATCG CCATA 6S13 

AAACACGCTC ATTATCCAGT TTGGATGATT TCAACTATAT TATTAAAATC GATTCTTGGA 6573 

ACAGGATTGA AAATATTTAC AAAACTTGGA GCTCAAAAAA CTATCCTCGC GTCAAATATG 6633 

ACAAACTATT GTGGGAACCT ATTATATTAG GGCCGTCATT TGGTATAAAT GGGATGATGA 6693 

ACTTAGAACC CACCGCATTA GCGGACGAAG CTCTTACAAA TGAAACTATG GCTCCGGTAA 6753 

TTTCGAATAA CACACATATA GAAAACTATA ACAACAGTAG AGCACATAAT AAACGCAGAA 6813 

GAAGAAGAAG AAGAAGTAGT GAGCACAAAA CATCCAAGCT T 6854 

(2) INFORMATION FOR SEQ ID NO: 42: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2814 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

H C I AC £5° 00 07X3 CAT CCT 600 CCC AAA TCC 48 

Glu Phe Gin Tyr Thr Lys Gin Leu His Phe Pro Val Gly Pro Lys Ser 
1 5 10 15 

ACA AAC TGT GAG GTA GCG GAA ATT CTT TTA CAC TGC GAC TGG GAA AGO « 
Thr Asn Cys Glu Val Ala Glu He Leu Leu His Cys Asp Trp Glu Arg 
20 25 30 

TAC ATA AAT GTT TTA ACT ATA ACA AGA ACA CCA AAT GTT CCT AGT GGT 144 
Tyr He Asn Val Leu Ser He Thr Arg Thr Pro Asn Val Pro Ser Gly 
35 40 45 

ACC AGT TTC AGC ACC AGA ACG AGG TAC ATG TTC CGA TGG GAT GAC CAG 192 
Thr Ser Phe Ser Thr Arg Thr Arg Tyr Met Phe Arg Trp Asp Asp Gin 
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GGG CMGGTTGC ATA TTA AAA ATA ACT TTT TGG GTG GAC TGG AAC GCA 
Gly Gin Gly Cys He Leu Lys He Ser Phe Trp Val Asp Trp Asn Ala 

75 B0 

TCC AGT TGG ATC AAG CCA ATG GTA GAG AGC AAT TGT AAA AAT GGA CAA 
Ser Ser Trp He Lys Pro Met Val Glu Ser Asn Lys Asn Gly G^n 

85 90 95 

tTI ^ C A" GAC TTO GTA AAG TTA GTC GAA GAA TTT GTA GAG 

He Ser Ala Thr Lys Asp Leu Val Lys Leu Val Glu Glu Phe Val Glu 
100 1Q 5 110 

AAA TAC GTG GAA TTG AGC AAA GAA AAA GCA GAT ACA CTC AAG CCG TTT 
Lys Tyr Val Glu Leu Ser Lys Glu Lys Ala Asp t£ 2S t£ Pro 2S 

120 125 
CCC AGT GIT ACA TCT TTT GGA TCA CCT AGG AAA GTG GCA GCA CCG GAG 
Pro Ser Val Thr Ser Phe Gly Ser Pro Arg Lys Val Ala A^ Pro Glu 
■••■»" 135 140 

CTG TCG ATG GTA CAG CCG GAG TCG AAA CCA nns r«~r rnr r-r-^ 

Leu Ser Met Val Gin Pro Glu Ser i£ S£ £1 K S! IT, Glu S 
150 155 16Q 



TCA GAA ATA GGC AGC GAC AGA TGG AGG TTT AAC TGG GTG AAC ATA ATA 
Ser Glu lie Gly Ser Asp Arg Trp Arg Phe Asn Trp Val Asn lie i™ 
165 170 175 

ATC TTG GTG CTC TTG GTG TTA AAT CTG CTG TAT TTA ATG AAG Tm ,»r 
lie Leu Val Leu Leu Val Leu Asn Leu Leu Tyr Leu Met Lys Leu £n 

180 185 190 

AAG AAG ATG GAT AAG CTG ACQ AAC CTC ATG ACC CAC AAG GAC GAA GTT 
Lys Lys Met Asp Lys Leu Thr Asn Leu Met Thr His Lys Asp Glu VaT 
135 200 205 

vll Ala t£ lIu IT ^ tT A CCA GCC CAA GTA CAA TGG TCA 

210 ^5 * 2 1 " G1 " 

Arg Pro Arg Arg G^y Asp Val Leu ™ CAGAGTA ATCATGTAAT ATTGTATGTA 
22 5 230 

AGGTTATGTA TGTTCGTATG GTATGGAAAA AAAAAAAAAA AAAGGATGCT ATGTGGAGAA 
TGTAAGGCGT GGTAGCTCCG GATAATTCAG TCTGTAGGCT TCATCACGGG CAGTGGCCTG 
ACTCTGAGAG CTTGCTCCGG TATTAAGTTG TGCGTTTGAA ATTTTCTGGA AAAAAGAAAT 
TGATTGGTTG AAGCTATACT CGTCGAAAGA TTTCTTCGGC AGTGGTTGTT GCTCCACCTG 
CACGGGAGTT GTGTTTGCGT TTATGTTCGG CTTGGCTATA TTATTAGCGA GTGATGTTTG 
CAATTTGCTG TATTGAGAAT CAATTTGGGT GCGTAAGCTT TCAATAATTT TGCAGACCGC 
AGGCACTTCC AACTTTATGA GTTGCAGGTA TTCTCTTTTA TGAATATACG ATGACGACGA 
TGACGACGAC GCATCCATGC GCAAAAGCTC AGGGTGTCTA GATA G T TTGT TAGTCAATAA 
ATCCACATAT CTAAAATAAT AAATAAACGA CAGCGACAAG TCGTTGGCCT GGAACGCACA 
CTGTGCCTTT TCCAATATGC CGATGCATGT TTTCAGGTAA ATTCTCAATG GTATCGCCGG 



786 
846 
906 
966 
1026 
1086 
1146 
1206 
1266 
1326 



WO 98/13502 



PCT/US97/17276 



- 89 - 

ATTGAAGCGA TAATCCTTAG CGTCCTGAAC CAATTGCTTA CTAGACTTCA TGACCTACCG 
GGGCCAGATA AAGATGCGGA AGGAAGAGAA AAAATGTATA GTGGTTGGTG AACCGCAACA 
ATAATTCGTG CCAACACTTT AATCGAAGCA AAAATTGTCT TGTATGTTAT TAATATTATC 
TATCTAACCA TTGATTTACG TATAAAACTG TCGATCCTCA TCGCCTAGCA ATGAAAAAAT 

i rrrrGTTTT tittttcatt atttctcttt gttgcgtact ttttttcatt gcgtttcgcg 

GCAAAAGCGA TTCGAGTTGA CTGGAAGTGT GTTATACTAT AAAAAGTGTA TATGCCTATT 
TTTGGTTCTG ATCTTTACTT tactgttaag TACTGGCTGA GGCAGTAGAC TCTGCCTCTG 
TTACGGCAGC GGTATTCGCC TCGGCATCAG CAGCCGCCCA CGGTAGAGTA GGTTCTGTTG 
TTTTGACGTT TGCCAAGGTA CTGTCCAAAT GCTCCTTCAG CAAGGCCTCA TTACTTTCCT 
TCTCCGGACC CACCGATTGC GTGATCTCCT GTACACGGTT CAAGAACTTG TTCAAATTGT 
AGCCCGCAGC AGCATCAGAG ACTTCTTGTG TGTAAGGGAC ACCCCTCAAC TCCTTGACTC 
TTCTTTTGTG CACTTTGCCC TTTAAATGCG TTTTTAACGC TATAGCAGTC TCCATGTATT 
TGGCACAGTG TATGCAATAG TGCTGACCAA GGCCCGGTTT GGTTTCATCC AATGGCTGGT 
TCAGAAGCTT CTGTACTGAT TCCTTGGTGG ACAAATCGTT ATAGATCAGG TCCAAGTCTC 
GTGTTCTTCT TTTAGTCTTG TATCTCTTCA CCGAATATCT ACCCATGATG CGCTATTGTT 
TTATCTTCAC TTGTCTGTGT GTTTAACTGC CTTTCAATTC ACCTCATCTC ATCTCCCGCT 
ACTTTCCATA TATAAAAGCA AAATTAATTT GCTTTTTCCC CTGTCAGTAT AAAAAAATTT 
TCCGCAGGAT ATAGAAAAAA AAGAAATGAA ATTATAGTAG CGGTTATTTC CGTGGGGTGC 
TTTTTTACAC CTGTACATCT TTTCCCTCCG TACATTTTTT TTATTTTTTT TTTGGGTTTT 

TPrrrrrcGA tatttttccc tccgaaacta gttagcacaa taatgctgac taaggaaact 

TTTCATCTCA GAATTGATGG TCAGTTTGGT TTCTCTAGAG AATAG TTTAT AAAAAGATGT 
TGATGTGGAG CAACCATTTA TACATCCTTT CCGCAAGTGC TTTTGGAGTG GGACTTTCAA 
ACTTTAAAGT ACAGTATATC AAATAACTAA TTCAAGATGG CTAGAAGACC AGCTAGATGT 
TACAGATACC AAAAGAACAA GCCTTACCCA AAGTCTAGAT ACAACAGAGC TGTTCCAGAC 
TCCAAGATCA GAATCTACGA TTTGGGTAAG AAGAAGGCTA CCGTCGAT 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

lie Aan Leu Lye Ala Leu Ala Ala Leu Ala Lys Lys lie Leu 



1386 

1446 

1506 

1S66 

1626 

1686 

1746 

1806 

1866 

1926 

1986 

2046 

2106 

2166 

2226 

2286 

2346 

2406 

2466 

2526 

2586 

2646 

2706 

2766 

2814 
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WHAT IS CLAIMED IS: 

1- A host cell transformed or transfected with DNA 

comprising: 

a repressor gene encoding a repressor protein, said 
repressor gene under transcriptional control of a promoter; 

a selectable marker gene encoding a selectable marker 
protein; said selectable marker gene under transcriptional 
control of an operator; said operator regulated by interaction 
with said repressor protein; 

a first recombinant fusion protein gene encoding a first 
binding protein or binding fragment thereof in frame with 
either a DNA binding domain of a transcriptional activating 
protein or a transacting domain of a transcriptional 
activating protein; and 

a second recombinant fusion protein gene encoding a 
second binding protein or binding fragment thereof in frame 
with either a DNA binding domain of a transcriptional 
activating protein or a transactivating domain of a 
transcriptional activating protein, whichever domain is not 
encoded by the first fusion protein gene, said second binding 
protein or binding fragment thereof capable of interacting with 
said first binding protein or binding fragment thereof such that 
interaction of said second binding protein or binding fragment 
thereof and said first binding protein or binding fragment 
thereof brings into proximity a DNA binding domain and a 
transactivating domain forming a functional transcriptional 
activating protein; said functional transcriptional activating 
protein acting on said promoter to increase expression of said 
repressor gene. 



WO 98/13502 



PCT/US97/I7276 



-91 - 

2. The host cell of claims I wherein said DNA binding 
domain and said transactivating domain are derived from a common 
transcriptional activating protein. 

3. The host cell of claim I wherein one or more of the 
repressor gene, the selectable marker gene, the first recombinant fusion 
protein gene, and the second recombinant fusion protein gene arc encoded on 
distinct DNA expression constructs. 

4. The host cell of claim 1 wherein said selectable marker 
protein is an enzyme in a pathway for synthesis of a nutritional requirement 
for said host cell such that expression of said selectable marker protein is 
required for growth of said host cell on media lacking said nutritional 
requirement. 

5. The host cell of claim 1 wherein said host cell is a yeast cell 
or a mammalian. 

6. The host cell of claim 2 wherein said selectable marker gene 

encodes HIS 3; 

7. The host cell of claim 2 wherein said repressor protein gene 
encodes a tetracycline resistance protein; 

8. The host cell of claim 2 wherein said operator is a let 

operator. 

9. The host cell of claim 2 wherein said promoter is selected 
from the group consisting of the LexA promoter, the alcohol dehydrogenase 
promoter, the Gal4 promoter. 
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10. The host cell of claim 2 wherein said DNA binding domain 
denved from a protein selected from the group consisting of LexA and Ga»4. 

11. The host cells of claim 2 wherein said transactivating 
domain is derived from a protein selected from the gro Up consisting of VP16 
and Gal4. 



12. The host cell of claim 2 wherein the first binding protein 
is CREB and the second binding protein is CBP. 

13. The host cell of claim 2 wherein the first binding protein 
is Tax and the second binding protein is SRF. 

14. The host cell of claim 2 wherein the first binding protein 
is casein kinase I and the second binding protein is CREB. 

15. The host cell of claim 2 wherein the first binding protein 
is AKAP 79 and the second binding protein is selected from the group 
consisting of RI, rjj and calcineurin. 

16. A method to identify an inhibitor of binding between a first 
binding protein or binding fragment thereof and a second binding protein or 
binding fragment thereof comprising the steps of: 

a) growing host cells of any one of claims 1 through 15 in 
the absence of a test compound and under conditions 
which permit expression of said first binding protein or 
binding fragment thereof and said second binding 
protein or binding fragment thereof such that said first 
binding protein or fragment thereof and second binding 
protein or binding fragment thereof interact bringing 
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into proximity said DNA binding domain and said 
transactivating domain forming said functional 
transcriptional activating protein; said transcriptional 
activating protein acting on said promoter to increase 
expression of said repressor protein; said repressor 
protein interacting with said operator such that said 
selectable marker protein is not expressed; 

b) confirming lack of expression of said selectable marker 
protein in said host cell; 

c) growing said host cells in the presence of a test 
compound; and 

d) comparing expression of said selectable marker protein 
in the presence and absence of said test compound 
wherein increased expression of said selectable marker 
protein is indicative that the test compound is an 
inhibitor of binding between said first binding protein or 
binding fragment thereof and said second binding 
protein or binding fragment thereof. 

17. The method of claim 16 wherein 
the host cell is a yeast cell; 
the selectable marker gene encodes HIS3; 
transcription of the selectable marker gene is regulated 
by the let operator; 

the repressor protein gene encodes the tetracycline 
resistance protein; 

transcription, of the tetracycline resistance protein is 
regulated by the LexA promoter; 
the DNA binding domain is derived from LexA; and 
the transactivating domain is derived from VP 16. 
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18. The method of claim 16 wherein 
the host ceil is a yeast cell; 
the selectable marker gene encodes HIS3; 
transcription of the selectable marker gene is regulated 
by the tet operator; 

the repressor protein gene encodes the tetracycline 
resistance protein; 

transcription of the tetracycline resistance protein is 
regulated by the alcohol dehydrogenase promoter; 
the DNA binding domain is derived from LexA; and 
the transactivating domain is derived from VPI6. 

19. A kit to identify an inhibitor of binding between a first 
binding protein or binding fragment thereof and a second binding protein or 
binding fragment thereof, said inhibitor identified by the method of claim 16. 
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1. Claims: 1-12, 16-19 

A host cell transformed or transfected with DNA comprising: 
a repressor gene encoding a repressor protein, said 
repressor gene under transcriptional control of a promoter; 
a selectable maker gene encoding a selectable protein; said 
marker gene under transcriptional control of an operator; 
said operator regulated by interaction with said repressor 
protein; a first recombinant fusion protein gene encoding a 
first binding potein or binding fragment thereof in frame 
with either a transactivator domain of a transcriptional 
activator protein; and a second recombinant fusion protein 
or binding fragment thereof in frame with either a DNA 
binding domain of a transcriptional activating protein, 
whichever domain is not encoded by the first fusion protein 
gene, said second binding protein or binding fragment 
thereof capable of interacting with said first binding 
protein or binding fragment thereof such that interaction of 
said second binding protein or binding fragment thereof 
brings into proximity a DNA binding domain and a 
transactivating domain forming a functional transcriptional 
activating protein; said functional transcriptional 
activating protein acting on said promoter to increase 
expression of said repressor gene; said DNA binding domain 
and said transactivating domain are derived from a common 
transcriptional activating protein; one or more of the 
repressor gene, the selectable marker gene, and the first 
and second recombinant fusion protein genes, are encoded on 
distinct DNA expression constructs; said host cell wherein 
said selectable marker protein is an enzyme; said host cell 
is a yeast cell or a mammalian; said selectable marker gene 
encodes HIS3, said repressor protein gene encodes a 
tetracyline resistant protein; said operator is a tet 
operator; said promoter is selcted from the group consisting 
of the LexA- , the alcohol dehydrogenase-, the GAl4-promoter; 
said DNA binding domain derived from a protein from the 
group consisting of LexA and Gal 4; said transactivating 
domain is derived from a protein selected from the group 
consisting of VP16 and Gal4; said first binding protein is 
CREB and the second binding protein is CBP; a method and kit 
to identify an inhibitor of binding between a first and a 
second binding protein using said host cell. 



2- Claim : 13 

The host cell of subject one but wherein the first binding 
protein is Tax and the second binding protein is SRF. 
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3. Claim : 14 



The host cell of subject one but wherein the first binding 
protein is casein kinase I and the second binding protein is 



The host cell of subject one but wherein the first binding 
protein is AKAP 79 and the second binding protein is 
selected from the group consisting RI, RII and calcineurin. 
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